mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Palumbo (JIRA)" <j...@apache.org>
Subject [jira] [Reopened] (MAHOUT-1493) Port Naive Bayes to the Spark DSL
Date Thu, 26 Mar 2015 22:35:53 GMT

     [ https://issues.apache.org/jira/browse/MAHOUT-1493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Andrew Palumbo reopened MAHOUT-1493:
------------------------------------

Reopening to finish some work and clean-up here:
 - fix #78 based on Pat's comments and push
 - add CLI options for aplha_i and -overwrite (save Regex key parser option for 0.10.1)
 - Currently testNB brings all code into Memory up front and runs sequentially (would like
to get this fixed for 0.10.0)
      -- Work is done except Classifier classes need to be fully serializable to mapBlock
closures
 - Clean up code, comments, etc.
 

> Port Naive Bayes to the Spark DSL
> ---------------------------------
>
>                 Key: MAHOUT-1493
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1493
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>            Reporter: Sebastian Schelter
>            Assignee: Andrew Palumbo
>              Labels: DSL, h2o, scala
>             Fix For: 0.10.0
>
>         Attachments: MAHOUT-1493.patch, MAHOUT-1493.patch, MAHOUT-1493.patch, MAHOUT-1493.patch,
MAHOUT-1493a.patch
>
>
> Port our Naive Bayes implementation to the new spark dsl. Shouldn't require more than
a few lines of code.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message