Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 10872 invoked from network); 22 Apr 2010 10:02:46 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 22 Apr 2010 10:02:46 -0000 Received: (qmail 28658 invoked by uid 500); 22 Apr 2010 10:02:46 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 28043 invoked by uid 500); 22 Apr 2010 10:02:43 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 28029 invoked by uid 99); 22 Apr 2010 10:02:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Apr 2010 10:02:42 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of dokondr@gmail.com designates 74.125.83.48 as permitted sender) Received: from [74.125.83.48] (HELO mail-gw0-f48.google.com) (74.125.83.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Apr 2010 10:02:37 +0000 Received: by gwb1 with SMTP id 1so1327748gwb.35 for ; Thu, 22 Apr 2010 03:02:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:received:message-id:subject:from:to:content-type; bh=SnB3mCfbMF6cd+LbbOlVnyB0M5DMwlugPnXExGUrTMI=; b=igIa75EVLOQ2rtDjOt9ruU28M2u6gdfpWLzDr6ATy6apo4YNv01FtHAfWQFImpwAew 0m/g56EOfJTF56XDt4UI7YeA5wKk4smSy4m6cjbAfuET4FsZMy4haEwtv1O8Ju9PihI1 htK8DV8TvsN30mAvBwgWgXCFUpdzX/rcibjyE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=TSczpuv5xNFH35fbstH7SSfJekquOqklV7+ErLOsKTBl4P/Hu7ztd/4xHusQUI8LI6 VNdC8J7htNGvTuxhLjjfJ029xzelfHl0854GED92sNKc//7j/wbv53MFGrx/vwiZIivH xZDuV/nRq9pCTJ9PxxB60hCJv9rw1a8EIUWgs= MIME-Version: 1.0 Received: by 10.90.92.5 with HTTP; Thu, 22 Apr 2010 03:02:16 -0700 (PDT) In-Reply-To: References: Date: Thu, 22 Apr 2010 14:02:16 +0400 Received: by 10.90.20.38 with SMTP id 38mr4875326agt.10.1271930536679; Thu, 22 Apr 2010 03:02:16 -0700 (PDT) Message-ID: Subject: Re: Function approximation in Mahout? From: "Dmitri O.Kondratiev" To: mahout-user@lucene.apache.org Content-Type: multipart/alternative; boundary=0016361e8418d159670484d0670d --0016361e8418d159670484d0670d Content-Type: text/plain; charset=ISO-8859-1 I have 165000 observations. Each observation is 180 component vector. One component in this vector is an integer in range from 0 to 200. Every vector is constructed from sensor readings taken at 180 consecutive times. Readings may fail at some unpredictable times, thus some vector components may be undefined. 40% - 60% of random vector components may be undefined in every vector. I am trying to find an approximation function that will allow me to predict sensor reading at any time. There may be different correlations between some of these vectors. In fact these sensors read traffic speed on different streets of a big city during a period of a month. What other methods for prediction with incomplete data can be used for the task I described? Thanks! >>From Ted Dunning >Subject Re: Function approximation in Mahout? >Date Wed, 21 Apr 2010 14:34:58 GMT > >Mahout does not have a lot of regression capabilities at this time, other >than various forms of binomial regression (SVM, logistic regression, >decision forests) but other forms of regression are relatively lacking. > >Commons math has some capabilities, but not in a particularly scalable form. > >What size is your problem? > >On Tue, Apr 20, 2010 at 2:07 PM, Dmitri O.Kondratiev wrote: > >> Hello, >> Does Mahout support any function approximation frameworks, such as greedy >> function approximation with gradient boosting (TreeNet)? >> http://en.wikipedia.org/wiki/TreeNet#Names >> >> Thanks! >> --0016361e8418d159670484d0670d--