mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <>
Subject Re: Memory and Speed Questions for Item-Based-Recommender
Date Fri, 10 Jul 2009 19:57:53 GMT
Don't think so.  Sean should comment definitively.

It is actually very easy to do.  The output of the recommendation off-line
process (in my case, statistical filtering of the coocurrence matrix, in
other cases something different) is generally a sparse matrix of item-item
links.  Each line of this sparse matrix can be considered a document in
creating a Lucene index.  You will have to use a correct analyzer and a line
by line document segmenter, but that is trivial.

Then recommendation is a simple query step.

You guys at Linked-in have nice ability to present Lucene results in
real-time so the part after gettting the item-item matrix should be dead
simple for you.

On Fri, Jul 10, 2009 at 12:48 PM, Jason Rutherglen <> wrote:

> Is there an example of this (using Lucene to store item-item links in
> Lucene) in Mahout?  Sounds interesting.
> On Fri, Jul 10, 2009 at 11:35 AM, Ted Dunning <>
> wrote:
> > Storing the item-item links in Lucene and forming a query with recent
> > history is a pretty easy way to get real-time recommendations.  This can
> > also get rid of the cache because standard measures applied to make
> Lucene
> > fast will work on this.
> >

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message