mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Dunning (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAHOUT-708) Update to Hadoop 0.21
Date Sat, 21 May 2011 21:50:47 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13037463#comment-13037463
] 

Ted Dunning commented on MAHOUT-708:
------------------------------------

Frankly, I think that the next version of Hadoop that provides any compelling features for
large enterprises is likely to be 0.23 where MR nextgen comes into play.  Ironically, the
major reason that version will be exciting is that it allows for compatibility with old API's.
 The ability to have old and new side-by-side is a pre-requisite for any large cluster upgrade.

> Update to Hadoop 0.21
> ---------------------
>
>                 Key: MAHOUT-708
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-708
>             Project: Mahout
>          Issue Type: Task
>          Components: Classification, Clustering, Collaborative Filtering, Frequent Itemset/Association
Rule Mining
>    Affects Versions: 0.5
>            Reporter: Sean Owen
>            Assignee: Sean Owen
>              Labels: hadoop
>             Fix For: 0.6
>
>
> I suggest we should move to Hadoop 0.21 for the next release. It is the current release,
soon to be superseded by 0.22. It matches more closely what CDH3/4 users use. It has bug fixes,
and crucially some features that make joins much less painful.
> The drawback is that EMR does not yet support it. I still suggest we forge ahead as one
imagines it will be supported by the time we release 0.6.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message