Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3061D11D0B for ; Thu, 9 May 2013 12:53:46 +0000 (UTC) Received: (qmail 88821 invoked by uid 500); 9 May 2013 12:53:44 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 88652 invoked by uid 500); 9 May 2013 12:53:44 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 88640 invoked by uid 99); 9 May 2013 12:53:43 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 May 2013 12:53:43 +0000 X-ASF-Spam-Status: No, hits=2.4 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HK_RANDOM_ENVFROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mattmcclain1@gmail.com designates 209.85.223.170 as permitted sender) Received: from [209.85.223.170] (HELO mail-ie0-f170.google.com) (209.85.223.170) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 May 2013 12:53:37 +0000 Received: by mail-ie0-f170.google.com with SMTP id aq17so5477044iec.1 for ; Thu, 09 May 2013 05:53:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type; bh=A3wPeLtuLoTyxWRCD7K4J/wSpKtbPeGfFaG3RS3QrEY=; b=OOlRA4Ng/NMpiFGtPISvm1+Lrv5EkRBodPKdPZEh4eabvsAVqJErj77XValZd7GBRZ Lh85Oyzqn+Y/eQbrNVb4g1fPx6cg9aco7ABoWC7J4IuKsDuntt57RZIoMHbvChbN3IKd GeGDACrmWAW8Skt8PLhCH8jQiGroWuECARye3kypUkXULKyHL6h/2RrEsPhVJxQy4j9e PnCxLhsk6J8mHoE5dT1efOTwekD3xhYsnTOTzLiItWzBYhdfLxzOdd0/jW+k4qHXGSA7 hlmdSp3SZAErocypMpu74pKwBmKzp4yrje3WactYEsCPtqIVFP2E2bgR24YSlV4kDRAe INAA== X-Received: by 10.50.78.232 with SMTP id e8mr4929203igx.72.1368103996116; Thu, 09 May 2013 05:53:16 -0700 (PDT) MIME-Version: 1.0 Received: by 10.64.106.72 with HTTP; Thu, 9 May 2013 05:52:56 -0700 (PDT) In-Reply-To: References: From: Matthew McClain Date: Thu, 9 May 2013 07:52:56 -0500 Message-ID: Subject: Re: Which is the right approach to follow? To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=089e013c6a20b4547304dc4888d8 X-Virus-Checked: Checked by ClamAV on apache.org --089e013c6a20b4547304dc4888d8 Content-Type: text/plain; charset=ISO-8859-1 Karan, Without knowing why clustering didn't work, it's hard to say what a better approach would be. Any other information you can give about the problem you're working on would probably help, too. In particular, how did you come up with your four categories? Typically, categories are not defined directly in terms of the data, because the problem is to find the relationship between the categories and data. Matt On Wed, May 8, 2013 at 5:44 AM, Karan wrote: > Hi All, > > I have some numerical data in pairs say X & Y and I want to divide(cluster, > may be) into four groups as LowX-LowY,LowX-HighY,HighX-LowY & HighX-HighY. > I > tried with clustering but unable to identify clusters(and i think is not > the > best way to achieve it). Can someone suggest any good(non-trivial) approach > to proceed. > > Thanks, > Karan > > --089e013c6a20b4547304dc4888d8--