mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Suneel Marthi <suneel.mar...@gmail.com>
Subject Re: does Mahout support Hadoop 2.5.0 ?
Date Fri, 22 Aug 2014 20:57:22 GMT
u would see that error with KMeans if u do not provide an input centroids
or ask KMeans to randomly generate initial centroids.
look at the wiki page for running Mahout KMeans and ensure that u r not
missing anything.


On Sat, Aug 23, 2014 at 2:22 AM, Wei Zhang <wei.wisc@gmail.com> wrote:

> Thanks a lot Pat for the pointer! I tried with the latest trunk against
> Hadoop 2.5.0 for the k-means cluster example, it succesfully finished the
> text to tifidf vectorization MR jobs but failed at the actual clustering
> complaining
>
>  Exception in thread "main" java.lang.IllegalStateException: No input
> clusters found in ...  Check your -c argument.
>
> I had the same problem before when running against Hadoop 1.2.1, but the
> problem went away after I reformatted HDFS. I didn't get lucky with HDFS
> reformat this time. I'll do a bit more investigation later.
>
> Thanks again for the pointers!
>
> Wei
>
>
> On Fri, Aug 22, 2014 at 11:20 AM, Pat Ferrel <pat@occamsmachete.com>
> wrote:
>
> > I’m no expert here since I’m stuck on hadoop 1.2.1 but the latest master
> > on github is meant to be used with hadoop 2.x as the default.
> > Mahout 0.9 had some experimental support but is AFAIK not recommended for
> > h 2. No clue about hadoop 2.5 specifically but if it truly is a minor
> > backwards compatible release you may be ok. Mahout now uses Spark, which
> > claims support through hadoop 2.4.x.
> >
> > On Aug 20, 2014, at 3:03 PM, Wei Zhang <wei.wisc@gmail.com> wrote:
> >
> > Hello,
> >
> > After a system upgrade, we have a Hadoop 2.5.0 cloud instance. We are
> > trying to run Mahout on top of it. We are using Mahout 0.9 (downloaded
> from
> > http://mahout.apache.org/general/downloads.html )
> >
> > https://issues.apache.org/jira/browse/MAHOUT-1329 indicates Mahout
> > supports
> > Hadoop 2.2.0
> >
> > But even with mvn clean install -Dhadoop2.version=2.2.0 -DskipTests=true,
> > it doesn't appear Hadoop 2.x jar is downloaded anywhere (i.e., there is
> > only Hadoop 1.2.1 jar downloaded, which is what the pom.xml dictates).
> >
> > Further, even a simple HDFS I/O call would fail (i.e., creating an HDFS
> > file). It seems Hadoop 2.2.0 is not quite compatible with Mahout 0.9
> >
> > My question is:
> > (1) Does Mahout support Hadoop 2.5.0 or should I check out the code from
> > git (as suggested at
> > http://mahout.apache.org/developers/buildingmahout.html)
> > in order to get a Hadoop 2.x friendly Mahout ?
> >
> > Thanks!
> >
> > Wei
> >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message