mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Palumbo (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAHOUT-1655) Refactor module dependencies
Date Tue, 31 Mar 2015 22:56:54 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389570#comment-14389570
] 

Andrew Palumbo commented on MAHOUT-1655:
----------------------------------------

[~pferrel] I'm actually getting an error in mahout spark-testnb (sorry- I'd only tested spark-trainnb
before on this branch)

it looks like t-digest jars are not being picked up in the dependency-reduced jar?... I'm
not sure if its from this refactoring.  maybe best to adress after you're done with this?


{code}
 Exception in thread "main" java.lang.NoClassDefFoundError: com/tdunning/math/stats/TDigest
        at org.apache.mahout.classifier.stats.ResultAnalyzer.<init>(ClassifierStats.scala:64)
        at org.apache.mahout.classifier.naivebayes.NaiveBayes$class.test(NaiveBayes.scala:303)
        at org.apache.mahout.classifier.naivebayes.NaiveBayes$.test(NaiveBayes.scala:336)
        at org.apache.mahout.drivers.TestNBDriver$.process(TestNBDriver.scala:105)
        at org.apache.mahout.drivers.TestNBDriver$$anonfun$main$1.apply(TestNBDriver.scala:77)
        at org.apache.mahout.drivers.TestNBDriver$$anonfun$main$1.apply(TestNBDriver.scala:75)
        at scala.Option.map(Option.scala:145)
...
{code}

> Refactor module dependencies
> ----------------------------
>
>                 Key: MAHOUT-1655
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1655
>             Project: Mahout
>          Issue Type: Improvement
>          Components: mrlegacy
>    Affects Versions: 0.9
>            Reporter: Pat Ferrel
>            Assignee: Andrew Musselman
>            Priority: Critical
>             Fix For: 0.10.0
>
>
> Make a new module, call it mahout-hadoop. Move anything there that is currently in mrlegacy
but used in math-scala or spark. Remove dependencies on mrlegacy altogether if possible by
using other core classes.
> The goal is to have math-scala and spark module depend on math, and a small module called
mahout-hadoop (much smaller than mrlegacy). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message