Return-Path: X-Original-To: apmail-mahout-dev-archive@www.apache.org Delivered-To: apmail-mahout-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8912410662 for ; Wed, 6 Nov 2013 02:05:25 +0000 (UTC) Received: (qmail 78767 invoked by uid 500); 6 Nov 2013 02:05:24 -0000 Delivered-To: apmail-mahout-dev-archive@mahout.apache.org Received: (qmail 78688 invoked by uid 500); 6 Nov 2013 02:05:24 -0000 Mailing-List: contact dev-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list dev@mahout.apache.org Received: (qmail 78680 invoked by uid 99); 6 Nov 2013 02:05:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Nov 2013 02:05:24 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ted.dunning@gmail.com designates 209.85.220.170 as permitted sender) Received: from [209.85.220.170] (HELO mail-vc0-f170.google.com) (209.85.220.170) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Nov 2013 02:05:19 +0000 Received: by mail-vc0-f170.google.com with SMTP id hv10so6320021vcb.29 for ; Tue, 05 Nov 2013 18:04:58 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=1keTER6dfF2es4yQQ0f2hqfMmiVIf5RthwPolPKYoBY=; b=SCxZS4JtFLTrILMAx8RvWSnAHA6DNZ6xQivLtaTAJav5CPZbuJMO6sXxhuhdiRqM/T HmmFiFjsYIOsdQUQA3WivRoAtGpzDXRATmAHS1X454f65D6WAOw3M+cxko8z4yJ7P6FR YLyxmJRuQ98TzDYIgraasR1rX0EvCEX4fK+CFWkksVtYPfkEmc5GboJD0XkivjJtuziC +K4D9u7DSBaaXp4EDZyMS2+JHWT8NcGrmM0enqb1K/cFKsPBkWDJx4OBLXLivDzVSlmk YLTi2KWvxCKVzx4NdFteIirsLyzQIY+qdYzJx35PrC6zJ43HH9msUc1dznNdMukM0DFr klUA== X-Received: by 10.220.182.69 with SMTP id cb5mr332589vcb.41.1383703498472; Tue, 05 Nov 2013 18:04:58 -0800 (PST) MIME-Version: 1.0 Received: by 10.58.163.9 with HTTP; Tue, 5 Nov 2013 18:04:28 -0800 (PST) In-Reply-To: References: From: Ted Dunning Date: Tue, 5 Nov 2013 18:04:28 -0800 Message-ID: Subject: Re: [jira] [Updated] (MAHOUT-1351) Adding DenseVector support to AbstractCluster To: Mahout Dev List Content-Type: multipart/alternative; boundary=001a1132e2c280462104ea78939e X-Virus-Checked: Checked by ClamAV on apache.org --001a1132e2c280462104ea78939e Content-Type: text/plain; charset=UTF-8 Dave, This is great. Do you know if the new streaming k-means has the same problem? On Tue, Nov 5, 2013 at 3:02 PM, Dave DeBarr (JIRA) wrote: > > [ > https://issues.apache.org/jira/browse/MAHOUT-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel] > > Dave DeBarr updated MAHOUT-1351: > -------------------------------- > > Status: Patch Available (was: Open) > > This simple "svn diff" (patch) resolves issue MAHOUT-1351 > > > Adding DenseVector support to AbstractCluster > > --------------------------------------------- > > > > Key: MAHOUT-1351 > > URL: https://issues.apache.org/jira/browse/MAHOUT-1351 > > Project: Mahout > > Issue Type: Improvement > > Components: Clustering > > Affects Versions: 0.8 > > Reporter: Dave DeBarr > > Priority: Minor > > Labels: performance > > Fix For: 0.9 > > > > Attachments: MAHOUT-1351.patch > > > > Original Estimate: 1h > > Remaining Estimate: 1h > > > > This improvement reduces runtime by 80% when performing k-means > clustering of Scale Invariant Feature Transform (SIFT) descriptors to > derive visual words for computer vision. Unlike sparse document vectors, > SIFT descriptors are dense. This improvement involves updating the > org.apache.mahout.clustering.AbstractCluster(Vector point, int id2) > constructor to use "point.clone()" instead of "new > RandomAccessSparseVector(point)" for creating the centroid. Also added > testKMeansSeqJobDenseVector() test for DenseVector processing. > > > > -- > This message was sent by Atlassian JIRA > (v6.1#6144) > --001a1132e2c280462104ea78939e--