Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 56352 invoked from network); 22 Aug 2009 20:18:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 22 Aug 2009 20:18:06 -0000 Received: (qmail 50241 invoked by uid 500); 22 Aug 2009 20:18:26 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 50167 invoked by uid 500); 22 Aug 2009 20:18:26 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 50157 invoked by uid 99); 22 Aug 2009 20:18:26 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 22 Aug 2009 20:18:26 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 22 Aug 2009 20:18:17 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1Mex2C-0002nz-B1 for mahout-user@lucene.apache.org; Sat, 22 Aug 2009 13:17:56 -0700 Message-ID: <25097395.post@talk.nabble.com> Date: Sat, 22 Aug 2009 13:17:56 -0700 (PDT) From: Tim Hughes To: mahout-user@lucene.apache.org Subject: Re: Custom Algorithm (C/C++) ? In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Nabble-From: thughes@troglobyte.com References: <25096676.post@talk.nabble.com> <25097210.post@talk.nabble.com> X-Virus-Checked: Checked by ClamAV on apache.org We are looking to do a query of documents & abstracts from a legacy system, then retrieve the docs for clustering & classification via Mahout. Expected volume is something on the order of 2,000 - 3,000 documents. Ted Dunning wrote: > > Can you say more about your application? > > Mahout is a very young project and is known to be sub-standard in a number > of respects due to youth. Depending on what you need, it might be > excellent, or seriously deficient (at the moment). The deficiencies will > be > addressed over time, but full disclosure now is important. > > Depending on what you need, an on-line learning system like vowpal might > be > much better for you. > > On Sat, Aug 22, 2009 at 12:59 PM, Tim Hughes > wrote: > >> We're looking specifically at Mahout (on top of the other supporting >> Apache >> projects). One of the roadblocks to moving in that direction is the >> concern >> about Java performance. We could not go the Mahout direction if there was >> no >> way to use C/C++; since there is, we can bypass the "premature >> optimization" >> and run Mahout as designed, yet have the ability to fall back to custom C >> code if the user's expectations are not met. >> > > > > -- > Ted Dunning, CTO > DeepDyve > > -- View this message in context: http://www.nabble.com/Custom-Algorithm-%28C-C%2B%2B%29---tp25096676p25097395.html Sent from the Mahout User List mailing list archive at Nabble.com.