mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Deneche A. Hakim (JIRA)" <>
Subject [jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests
Date Mon, 17 Aug 2009 08:34:14 GMT


Deneche A. Hakim updated MAHOUT-145:

    Attachment: partial_August_17.patch

*GSoC latest patch*
* DONE: move rf.ref.examples.BreimanExample to examples/
* DONE: move rf.ref.examples.CpuTest to core/tests (tools package)
* DONE: move rf.ref.examples.MemoryUsage to core/tests (tools package)
* DONE: move rf.ref.examples.PartialStep2Test to core/tests (tools package), becomes PartialStep2Check
* DONE: move content of rf.ref.examples.UciDescriptors to ExampleUtils

* DONE: org.apache.mahout.rf becomes org.apache.mahout.df (Decision Forest)

* DONE: Check that all files contain Apache License

* DONE: add a link to Andrew's tutorial in DefaultTreeBuilder

This should be the last patch concerning GSoC. The next ones will target the 0.2 release

> PartialData mapreduce Random Forests
> ------------------------------------
>                 Key: MAHOUT-145
>                 URL:
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>            Reporter: Deneche A. Hakim
>            Priority: Minor
>         Attachments: partial_August_10.patch, partial_August_13.patch, partial_August_15.patch,
partial_August_17.patch, partial_August_2.patch, partial_August_9.patch
> This implementation is based on a suggestion by Ted:
> "modify the original algorithm to build multiple trees for different portions of the
data. That loses some of the solidity of the original method, but could actually do better
if the splits exposed non-stationary behavior."

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message