mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ankur (JIRA)" <>
Subject [jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood
Date Thu, 12 Nov 2009 10:15:39 GMT


Ankur commented on MAHOUT-103:

>Re-post an updated patch ....

Sure I'll have the updated code coming by early next week.

>If it's basically sound I'd like to mention it ....

+10, The more people know about it the better chances it has of being used :-)  

>I use the GroupLens, Jester, Netflix data sets regularly. Indeed, just drop the rating

Simply dropping the rating might introduce too much noise. I was thinking of keeoing only
those that have ratings > 2.5 (or 2 to be more liberal). 

> Co-occurence based nearest neighbourhood
> ----------------------------------------
>                 Key: MAHOUT-103
>                 URL:
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Collaborative Filtering
>            Reporter: Ankur
>            Assignee: Ankur
>         Attachments: jira-103.patch
> Nearest neighborhood type queries for users/items can be answered efficiently and effectively
by analyzing the co-occurrence model of a user/item w.r.t another. This patch aims at providing
an implementation for answering such queries based upon simple co-occurrence counts.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message