mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Dunning (Commented) (JIRA)" <>
Subject [jira] [Commented] (MAHOUT-943) Improbe the way to make the split point on DF.
Date Wed, 11 Jan 2012 12:43:45 GMT


Ted Dunning commented on MAHOUT-943:

Also, that isn't a particularly good way to compute variance in the first place.

Better to use Welford's method.  Better, use something like the OnlineSummarizer.

> Improbe the way to make the split point on DF.
> ----------------------------------------------
>                 Key: MAHOUT-943
>                 URL:
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification
>            Reporter: Ikumasa Mukai
>              Labels: DecisionForest
> The numericalSplit() on OptIgSplit adopts the way to regard the attribute value having
the best IG as the split point.
> But I think this is a little too strict and think it is better on some situation to 
use the average value which is calced with the best IG value and the 2nd value.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message