mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Owen <sro...@gmail.com>
Subject Re: 20news
Date Tue, 05 Jul 2011 07:20:17 GMT
I committed a change to make the parsing bits I found in .bayes. use space
and tab. You can try again. I confess I don't know this code and there's a
lot of little pieces of parsing here and there so don't know if this is the
heart of the issue.

On Mon, Jul 4, 2011 at 4:08 PM, Vijay Santhanam
<vijay.santhanam@gmail.com>wrote:

> Hi Sean,
>
> Thanks for responding.
>
> I would expect the sequential classifer tokenizer to be identical to what's
> used in the parallel classifier tokenizer.
>
> If that's not possible, then NGrams should perhaps be configurable with
> where it finds it's first token (i.e. the label).
>
> I'm very new to hadoop and this world, so I'm not sure what I'm looking at
> when it the classifier goes into mapreduce execution.
>
> -V
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message