I have 165000 observations. Each observation is 180 component vector. One
component in this vector is an integer in range from 0 to 200.
Every vector is constructed from sensor readings taken at 180 consecutive
times.
Readings may fail at some unpredictable times, thus some vector components
may be undefined.
40%  60% of random vector components may be undefined in every vector.
I am trying to find an approximation function that will allow me to predict
sensor reading at any time. There may be different correlations between some
of these vectors.
In fact these sensors read traffic speed on different streets of a big city
during a period of a month.
What other methods for prediction with incomplete data can be used for the
task I described?
Thanks!
>>From Ted Dunning <ted.dunn...@gmail.com>
>Subject Re: Function approximation in Mahout?
>Date Wed, 21 Apr 2010 14:34:58 GMT
>
>Mahout does not have a lot of regression capabilities at this time, other
>than various forms of binomial regression (SVM, logistic regression,
>decision forests) but other forms of regression are relatively lacking.
>
>Commons math has some capabilities, but not in a particularly scalable
form.
>
>What size is your problem?
>
>On Tue, Apr 20, 2010 at 2:07 PM, Dmitri O.Kondratiev <dokondr@gmail.com
>wrote:
>
>> Hello,
>> Does Mahout support any function approximation frameworks, such as greedy
>> function approximation with gradient boosting (TreeNet)?
>> http://en.wikipedia.org/wiki/TreeNet#Names
>>
>> Thanks!
>>
