Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BB310410E for ; Mon, 9 May 2011 19:39:40 +0000 (UTC) Received: (qmail 56397 invoked by uid 500); 9 May 2011 19:39:39 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 56353 invoked by uid 500); 9 May 2011 19:39:39 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Delivered-To: moderator for user@mahout.apache.org Received: (qmail 55061 invoked by uid 500); 9 May 2011 19:38:40 -0000 Delivered-To: apmail-lucene-mahout-user@lucene.apache.org X-ASF-Spam-Status: No, hits=2.0 required=5.0 tests=FREEMAIL_FROM,RFC_ABUSE_POST,SPF_NEUTRAL,T_FRT_LOLITA1,T_TO_NO_BRKTS_FREEMAIL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: 216.139.236.26 is neither permitted nor denied by domain of mail2abin@gmail.com) Date: Mon, 9 May 2011 12:38:13 -0700 (PDT) From: mail2abin To: mahout-user@lucene.apache.org Message-ID: <1304969893258-2920165.post@n3.nabble.com> Subject: Clustering boolean vectors MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi, I was trying to run ItemBasedRecommender on GroupLens movie sample data, which requires the rating ( user preferences inp). But suppose I do not have the rating ( user prefereces) , rather I have an Item boolean attribute vector. [ like God father - 0|1|0|0|0|0|1 ] , where the two 1's may say Crime, Drama. ItemBasedRecommender requires a DataModel, which I do not have. Instead I think I should use some Clustering techniques based on the Item boolean attribute vector, as I understand and later get items which belongs to the cluster. Please give pointers to the right Clustering API ( though I have see TanimotoCluster etc.), not sure if they are good for boolean vectors. Abin Software Developer NY -- View this message in context: http://lucene.472066.n3.nabble.com/Clustering-boolean-vectors-tp2920165p2920165.html Sent from the Mahout User List mailing list archive at Nabble.com.