mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Isabel Drost (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAHOUT-716) Implement Boosting
Date Tue, 20 Mar 2012 21:47:39 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13233818#comment-13233818
] 

Isabel Drost commented on MAHOUT-716:
-------------------------------------

After not much activity - took a brief look at the patch. Some comments (to be taken with
a grain of salt after I didn't have the cycles to follow the project as closely as I would
have liked in the past months):

You mentioned a forked git repo on github - is it still online?

So far looks like a rather isolated change. Would it make sense to integrate it with existing
classification APIs e.g. org.apache.mahout.classifier.AbstractVectorClassifier?

Also some more documentation and a usage example for the un-initiated would be great: In addition
to links to one or two publications the implementation is based on it's always great to have
some information on the strengths and weaknesses of the implemented solution (yes, I know
we are doing pretty badly along these lines with other bits and pieces we have - still would
be nice to have).
                
> Implement Boosting
> ------------------
>
>                 Key: MAHOUT-716
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-716
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.5
>            Reporter: Hector Yee
>            Assignee: Ted Dunning
>            Priority: Minor
>              Labels: features
>         Attachments: MAHOUT-716.patch, MAHOUT-716.patch
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> Implement boosting (grad boost variant) with l1-regularization and induction.
> The gradient part is scalable and parallel and the induction part allows stochastic hypothesis
generation for speed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message