mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Owen (JIRA)" <>
Subject [jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs
Date Tue, 27 Apr 2010 12:17:31 GMT


Sean Owen commented on MAHOUT-305:

I see, fair enough. Even for this simplistic initial system, something better is called for.
Perhaps the mappers can keep a count of how many times each item has been seen and favor co-occurrences
among items that have *not* been seen. they wouldn't have a global count but such a simple
heuristic may be efficient and effective. For now I might arbitrarily prune, say, for user
vectors with more than 50 preferences.

> Combine both cooccurrence-based CF M/R jobs
> -------------------------------------------
>                 Key: MAHOUT-305
>                 URL:
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Collaborative Filtering
>    Affects Versions: 0.2
>            Reporter: Sean Owen
>            Assignee: Ankur
>            Priority: Minor
> We have two different but essentially identical MapReduce jobs to make recommendations
based on item co-occurrence:{item,cooccurrence}. They ought
to be merged. Not sure exactly how to approach that but noting this in JIRA, per Ankur.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message