mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: [jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs
Date Mon, 26 Apr 2010 21:38:25 GMT
On Mon, Apr 26, 2010 at 1:46 PM, Sean Owen (JIRA) <jira@apache.org> wrote:

> Ted how do you like to pick which items to pay attention to for
> co-occurrence? I'm looking for something simple to start.
>

LLR is my standard answer.


>
> Though it's running pretty well (well a lot better than it was) at the
> moment, with the aggressive combiner chucking out low-frequency
> co-occurrence.
>

That still worries me.  I would expect that you would get better by
down-sampling high frequency items.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message