mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robin Anil <robin.a...@gmail.com>
Subject Re: SparseVectors writing out a lot of data
Date Tue, 12 Jan 2010 09:08:10 GMT
Hi Sean, Could you take a look at the Patch and comment.

Robin

On Mon, Jan 11, 2010 at 10:39 PM, Sean Owen <srowen@gmail.com> wrote:

> If one needs a Reader based on the contents of a String, the
> StringReader is a far better way of doing this. This also has
> potential character set issues if the platform's default encoding
> varies. Definitely switch.
>
> On Mon, Jan 11, 2010 at 4:30 PM, Robin Anil <robin.anil@gmail.com> wrote:
> > aargh. Found the bug. Can you spot it ?
> >
> >      Text value = values.next();
> >      TokenStream ts =
> >          analyzer.tokenStream(key.toString(), new InputStreamReader(
> >              new ByteArrayInputStream(value.getBytes()),
> >              DictionaryVectorizer.CHARSET));
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message