mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <>
Subject Re: [jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs
Date Mon, 26 Apr 2010 21:38:25 GMT
On Mon, Apr 26, 2010 at 1:46 PM, Sean Owen (JIRA) <> wrote:

> Ted how do you like to pick which items to pay attention to for
> co-occurrence? I'm looking for something simple to start.

LLR is my standard answer.

> Though it's running pretty well (well a lot better than it was) at the
> moment, with the aggressive combiner chucking out low-frequency
> co-occurrence.

That still worries me.  I would expect that you would get better by
down-sampling high frequency items.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message