mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Email and Collab. Filtering
Date Mon, 22 Aug 2011 15:29:52 GMT
On Mon, Aug 22, 2011 at 8:21 AM, Daniel Xiaodan Zhou <danithaca@gmail.com>wrote:

> I think this is reasonable. Some suggestions:
>
> 1. Instead of using the total number of interactions as cell value, map the
> number to a 1-5 score based on histogram
>

I would map to {0,1} rather than a fake rating scale.


> 2. Use item-item algorithm, which is supposed to work for sparse data.
>

sort of works with sparse data.


> 3. I think the best algorithm to handle sparse data is the SVD algorithm.
>

Yes.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message