Return-Path: X-Original-To: apmail-mahout-dev-archive@www.apache.org Delivered-To: apmail-mahout-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C9AEE10E50 for ; Mon, 3 Mar 2014 05:54:01 +0000 (UTC) Received: (qmail 24144 invoked by uid 500); 3 Mar 2014 05:53:59 -0000 Delivered-To: apmail-mahout-dev-archive@mahout.apache.org Received: (qmail 23960 invoked by uid 500); 3 Mar 2014 05:53:57 -0000 Mailing-List: contact dev-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list dev@mahout.apache.org Received: (qmail 23952 invoked by uid 99); 3 Mar 2014 05:53:56 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Mar 2014 05:53:56 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of andy.twigg@gmail.com designates 209.85.217.173 as permitted sender) Received: from [209.85.217.173] (HELO mail-lb0-f173.google.com) (209.85.217.173) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Mar 2014 05:53:51 +0000 Received: by mail-lb0-f173.google.com with SMTP id p9so4293593lbv.18 for ; Sun, 02 Mar 2014 21:53:30 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=GA0ykFk+FCNolc/Dzn+0eRBJho6aKkrDXCBbvfd687Q=; b=uDKEqGvnMufgmaB0po4oFQ7QRnAwcOTg+Scw37iEEKdxs7QV2z3bnJP/cT+KGQtjeb e7q5E8dB/mwol2wLNjmWsnEJLPqfhX3qpDanYOhPeOSAUD1hrrh8uhp/A0L+A5yT3owb 9c6gXaJmSdoP7LzLPTBb142tTwDcBLS125UpcQHck5W63EJeQm5oW+LAD6gL4IuQq/3Z ybosiBoRrpxfLzIGKv+K2gqBPykpqVdFvLGwrxnuocEvGMeKxNqXF718QnvCy1GRyL4L tj0iYK9Ph+GIDTFuGpPEkojt/DAXnfCco58EAARULPbi4Fb7jjPGltARoG3r8QhJkh/U QG1w== X-Received: by 10.152.4.68 with SMTP id i4mr21333778lai.8.1393826010121; Sun, 02 Mar 2014 21:53:30 -0800 (PST) MIME-Version: 1.0 Received: by 10.112.50.137 with HTTP; Sun, 2 Mar 2014 21:52:50 -0800 (PST) In-Reply-To: References: From: Andy Twigg Date: Sun, 2 Mar 2014 21:52:50 -0800 Message-ID: Subject: Re: [jira] [Commented] (MAHOUT-1153) Implement streaming random forests To: Mahout-Dev Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org Yes, we could also consider committing it into the current mahout code base. There are probably some advantages over the current impl. What direction are you thinking? On 2 March 2014 13:57, Suneel Marthi (JIRA) wrote: > > [ https://issues.apache.org/jira/browse/MAHOUT-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13917581#comment-13917581 ] > > Suneel Marthi commented on MAHOUT-1153: > --------------------------------------- > > [~andytwigg] I understand this has been implemented on Spark and an implementation is available at (http://featurestream.io), do u think we should start the conversation of rolling this into Mahout? > >> Implement streaming random forests >> ---------------------------------- >> >> Key: MAHOUT-1153 >> URL: https://issues.apache.org/jira/browse/MAHOUT-1153 >> Project: Mahout >> Issue Type: New Feature >> Components: Classification >> Reporter: Andy Twigg >> Labels: features >> Fix For: Backlog >> >> >> The current random forest implementations are in-core and not scalable. This issue is to add an out-of-core, scalable, streaming implementation. Initially it could be based on [1], and using mappers in a master-worker style. >> [1] http://jmlr.csail.mit.edu/papers/volume11/ben-haim10a/ben-haim10a.pdf > > > > -- > This message was sent by Atlassian JIRA > (v6.2#6252)