Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 66828 invoked from network); 9 Feb 2011 19:11:10 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 9 Feb 2011 19:11:10 -0000 Received: (qmail 53642 invoked by uid 500); 9 Feb 2011 19:11:09 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 53440 invoked by uid 500); 9 Feb 2011 19:11:06 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 53432 invoked by uid 99); 9 Feb 2011 19:11:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Feb 2011 19:11:05 +0000 X-ASF-Spam-Status: No, hits=3.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of shotsoncloud9@gmail.com designates 209.85.210.42 as permitted sender) Received: from [209.85.210.42] (HELO mail-pz0-f42.google.com) (209.85.210.42) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Feb 2011 19:10:58 +0000 Received: by pzk9 with SMTP id 9so76988pzk.1 for ; Wed, 09 Feb 2011 11:10:38 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:date:message-id:subject:from:to :content-type; bh=BLvFWKjQfjBRDOK9xexfYa8W4nEJHxNiBHeuKXzzFW4=; b=gZGR50oI+jV8b33WlHto9HhoX196l3fQbU+6XmPbGS5E6OlZygSc88gZ1FmW9E7X5M YidtwE6/0lnOK+fRiONlDcEtj0XzbdBTdXI+RB0N9NX/4QymE4jmChA5vkGPvy2QY714 D1V6uVGcll7vaajCLFwNk3LvBCiuB+6HebZ+U= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=garBIAZwBKt2IAKoyi5Q6hHagfYGv0ejtQjxCD2wwLLusGe4h9ia74G8+VEYtccfH5 +ddPpcFsSmZBP9YnwvMdhsEN+Q89tceBXXtK9Bsmpyf+HF19Xi/4ymzEODUQ4z/H/5H1 ChdW1sNT5ZliVEWXRmrwMvU8NuhuwO/QW9unU= MIME-Version: 1.0 Received: by 10.142.187.2 with SMTP id k2mr18881843wff.97.1297278637976; Wed, 09 Feb 2011 11:10:37 -0800 (PST) Received: by 10.142.169.10 with HTTP; Wed, 9 Feb 2011 11:10:37 -0800 (PST) Date: Wed, 9 Feb 2011 11:10:37 -0800 Message-ID: Subject: New to Mahout From: Sharath Jagannath To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=000e0cd2dcec642ced049bde38e3 --000e0cd2dcec642ced049bde38e3 Content-Type: text/plain; charset=ISO-8859-1 Hi all, I am new to Mahout and Machine Learning in general. Have been playing with mahout command line tools and am able to cluster text documents, following the quick start. I am able to get the top term associated with a cluster using the clusterDumper. Is there a way to tell to which cluster a particular document belongs to? I can see them in the tf-idf vectors. Is there a tool/interface or should I write it on my own? Thanks, --MT --000e0cd2dcec642ced049bde38e3--