mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robin Anil <robin.a...@gmail.com>
Subject Re: Does it make sense to use Mahout for text classification when I have a huge number of documents but a small number of labels?
Date Wed, 17 Apr 2013 21:58:46 GMT
You wont its tiny amount of data. Mapper are determined by the split size
and input shards. Either shard the input more than 10 or reduce the map
split size.

Robin Anil | Software Engineer | +1 312 869 2602 | Google Inc.


On Wed, Apr 17, 2013 at 3:32 PM, Ryan Compton <compton.ryan@gmail.com>wrote:

> Any ideas where to look? Does anyone get more than 20 mappers when
> running the 20 news groups data?
>
> On Tue, Apr 16, 2013 at 9:04 PM, Robin Anil <robin.anil@gmail.com> wrote:
> > Sounds like a config issue. the Mr version should be able to parallelize
> > based on the size of the input.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message