mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ankur (JIRA)" <j...@apache.org>
Subject [jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood
Date Thu, 12 Nov 2009 10:15:39 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12776939#action_12776939
] 

Ankur commented on MAHOUT-103:
------------------------------

>Re-post an updated patch ....

Sure I'll have the updated code coming by early next week.

>If it's basically sound I'd like to mention it ....

+10, The more people know about it the better chances it has of being used :-)  

>I use the GroupLens, Jester, Netflix data sets regularly. Indeed, just drop the rating
...

Simply dropping the rating might introduce too much noise. I was thinking of keeoing only
those that have ratings > 2.5 (or 2 to be more liberal). 

> Co-occurence based nearest neighbourhood
> ----------------------------------------
>
>                 Key: MAHOUT-103
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-103
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Collaborative Filtering
>            Reporter: Ankur
>            Assignee: Ankur
>         Attachments: jira-103.patch
>
>
> Nearest neighborhood type queries for users/items can be answered efficiently and effectively
by analyzing the co-occurrence model of a user/item w.r.t another. This patch aims at providing
an implementation for answering such queries based upon simple co-occurrence counts.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message