mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: LDA from Lucene Indexes
Date Wed, 04 May 2011 18:40:17 GMT
Good point.

On Wed, May 4, 2011 at 11:31 AM, Jake Mannix <jake.mannix@gmail.com> wrote:

> On Wed, May 4, 2011 at 10:46 AM, Ted Dunning <ted.dunning@gmail.com>
> wrote:
>
> > Pipelining is good for abstraction and really bad for performance (in the
> > map-reduce world).
> >
> > My thought is that we could have a multipurpose tool.  Input would be a
> > lucene index and the program would read term vectors or original text as
> > available.  Output would be either sequence file full of text or sequence
> > file full of vectors.
> >
>
> Ok, sure, then this is modifying the lucene.vectors code, not the
> seq2sparse code, right?
>
>  -jake
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message