Universal input file format for classifier algorithms in Mahout
---------------------------------------------------------------
Key: MAHOUT-785
URL: https://issues.apache.org/jira/browse/MAHOUT-785
Project: Mahout
Issue Type: Improvement
Components: Classification
Affects Versions: 0.6
Reporter: XiaoboGu
I think a universal input file format is much more convinient for users, especially command
line users, and we should even consider use some universal command line options for the classification
algorithms, such as options for target/predictor variables and their types. Then users can
prepare their data once, and build different models to get the best one. Currentlly we should
consider the following:
1. SGD LogisticRegression
2. NaiveBayes
3. Bayes
4. Random Forest
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
|