mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <>
Subject Re: mahout PLSI (with some lucene, thrown in)
Date Tue, 23 Jun 2009 17:00:12 GMT
Yes.  This can be done.  It isn't necessarily real simple to do.

See for an
old (but still pretty good) example.

On Tue, Jun 23, 2009 at 6:45 AM, Paul Jones <>wrote:

> Imagine we have crawled 100K webpages, and we have 100 pages which show
> "red" and 100 which show "crimson" and then 100 which show both "red and
> crimson" is there a way to deduce that there maybe (albeit weak)
> relationship between red AND crimson. Of course we can pre-seed this info,
> which then gets weighted by actual results.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message