Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B144F7F90 for ; Sun, 25 Dec 2011 21:13:51 +0000 (UTC) Received: (qmail 92409 invoked by uid 500); 25 Dec 2011 21:13:50 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 92373 invoked by uid 500); 25 Dec 2011 21:13:50 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 92365 invoked by uid 99); 25 Dec 2011 21:13:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 25 Dec 2011 21:13:50 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ted.dunning@gmail.com designates 209.85.210.170 as permitted sender) Received: from [209.85.210.170] (HELO mail-iy0-f170.google.com) (209.85.210.170) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 25 Dec 2011 21:13:44 +0000 Received: by iafj26 with SMTP id j26so36546927iaf.1 for ; Sun, 25 Dec 2011 13:13:23 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=t63y5A9gFwJBUGi/yJLAmx3jXhqKJyhq+QVlykJdWr8=; b=n6eptTPVqdJGn3oR2HdCBxULmrB2aX06g4eCH16YAzTI33Yp9PrsgDPadQhzzhqe9y dIXFKy9Vjdx3pasWpZIL2u5Op2xedcnRI8Weoi7+s6inqbtwPAAOVAQnpsmIglfRYnml 8Uug/KbV/fEfP3kBnGx3eqTB43wS2tYptRHBc= Received: by 10.42.150.130 with SMTP id a2mr22860758icw.43.1324847603138; Sun, 25 Dec 2011 13:13:23 -0800 (PST) MIME-Version: 1.0 Received: by 10.50.197.161 with HTTP; Sun, 25 Dec 2011 13:13:02 -0800 (PST) In-Reply-To: <1324821545.72643.YahooMailNeo@web121704.mail.ne1.yahoo.com> References: <1324821545.72643.YahooMailNeo@web121704.mail.ne1.yahoo.com> From: Ted Dunning Date: Sun, 25 Dec 2011 13:13:02 -0800 Message-ID: Subject: Re: Mahout classifier on Hadoop To: user@mahout.apache.org, Lingxiang Cheng Content-Type: multipart/alternative; boundary=90e6ba21220bc4400104b4f11e96 --90e6ba21220bc4400104b4f11e96 Content-Type: text/plain; charset=UTF-8 Random forest works as a map-reduce program, but that does not produce arbitrary scalability. The Naive Bayes classifier is relatively natural as a map-reduce program and has a map-reduce version. The linear classifiers like linear regression do not have map-reduce versions (yet) since there is some difficulty in getting these to work well. On Sun, Dec 25, 2011 at 5:59 AM, Lingxiang Cheng wrote: > Hi, > > I am a newbie to Mahout. When I was reading the book "Mahout in > Action", I found chapters talking about how clustering naturally fit into > Map/Reduce framework, but I did not see the same claim for classifiers. > Does it involve a lot of work to make classifiers like random forest work > with Hadoop? > > Thanks! > Lingxiang Cheng --90e6ba21220bc4400104b4f11e96--