mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Isabel Drost (Commented) (JIRA)" <>
Subject [jira] [Commented] (MAHOUT-716) Implement Boosting
Date Tue, 20 Mar 2012 21:47:39 GMT


Isabel Drost commented on MAHOUT-716:

After not much activity - took a brief look at the patch. Some comments (to be taken with
a grain of salt after I didn't have the cycles to follow the project as closely as I would
have liked in the past months):

You mentioned a forked git repo on github - is it still online?

So far looks like a rather isolated change. Would it make sense to integrate it with existing
classification APIs e.g. org.apache.mahout.classifier.AbstractVectorClassifier?

Also some more documentation and a usage example for the un-initiated would be great: In addition
to links to one or two publications the implementation is based on it's always great to have
some information on the strengths and weaknesses of the implemented solution (yes, I know
we are doing pretty badly along these lines with other bits and pieces we have - still would
be nice to have).
> Implement Boosting
> ------------------
>                 Key: MAHOUT-716
>                 URL:
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.5
>            Reporter: Hector Yee
>            Assignee: Ted Dunning
>            Priority: Minor
>              Labels: features
>         Attachments: MAHOUT-716.patch, MAHOUT-716.patch
>   Original Estimate: 72h
>  Remaining Estimate: 72h
> Implement boosting (grad boost variant) with l1-regularization and induction.
> The gradient part is scalable and parallel and the induction part allows stochastic hypothesis
generation for speed.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message