mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "XiaoboGu (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAHOUT-785) Universal input file format for classifier algorithms in Mahout
Date Sun, 14 Aug 2011 04:42:27 GMT
Universal input file format for classifier algorithms in Mahout
---------------------------------------------------------------

                 Key: MAHOUT-785
                 URL: https://issues.apache.org/jira/browse/MAHOUT-785
             Project: Mahout
          Issue Type: Improvement
          Components: Classification
    Affects Versions: 0.6
            Reporter: XiaoboGu


I think a universal input file format is much more convinient for users, especially command
line users, and we should even consider use some universal command line options for the classification
algorithms, such as options for target/predictor variables and their types. Then users can
prepare their data once, and build different models to get the best one. Currentlly we should
consider the following:
1. SGD LogisticRegression
2. NaiveBayes
3. Bayes
4. Random Forest

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message