mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Setting up a recommender
Date Mon, 12 Aug 2013 21:32:58 GMT
Yes.  That would be interesting.




On Mon, Aug 12, 2013 at 1:25 PM, Gokhan Capan <gkhncpn@gmail.com> wrote:

> A little digression: Might a Matrix implementation backed by a Solr index
> and uses SolrJ for querying help at all for the Solr recommendation
> approach?
>
> It supports multiple fields of String, Text, or boolean flags.
>
> Best
> Gokhan
>
>
> On Wed, Aug 7, 2013 at 9:42 PM, Pat Ferrel <pat.ferrel@gmail.com> wrote:
>
> > Also a question about user history.
> >
> > I was planning to write these into separate directories so Solr could
> > fetch them from different sources but it occurs to me that it would be
> > better to join A and B by user ID and output a doc per user ID with three
> > fields, id, A item history, and B item history. Other fields could be
> added
> > for users metadata.
> >
> > Sound correct? This is what I'll do unless someone stops me.
> >
> > On Aug 7, 2013, at 11:25 AM, Pat Ferrel <pat@occamsmachete.com> wrote:
> >
> > Once you have a sample or example of what you think the
> > "log file" version will look like, can you post it? It would be great to
> > have example lines for two actions with or without the same item IDs.
> I'll
> > make sure we can digest it.
> >
> > I thought more about the ingest part and I don't think the one-item-space
> > is actually a problem. It just means one item dictionary. A and B will
> have
> > the right content, all I have to do is make sure the right ranks are
> input
> > to the MM,
> > Transpose, and RSJ. This in turn is only one extra count of the # of
> items
> > in A's item space. This should be a very easy change If my thinking is
> > correct.
> >
> >
> > On Aug 7, 2013, at 8:09 AM, Ted Dunning <ted.dunning@gmail.com> wrote:
> >
> > On Tue, Aug 6, 2013 at 7:57 AM, Pat Ferrel <pat.ferrel@gmail.com> wrote:
> >
> > > 4) To add more metadata to the Solr output will be left to the consumer
> > > for now. If there is a good data set to use we can illustrate how to do
> > it
> > > in the project. Ted may have some data for this from musicbrainz.
> >
> >
> > I am working on this issue now.
> >
> > The current state is that I can bring in a bunch of track names and links
> > to artist names and so on.  This would provide the basic set of items
> > (artists, genres, tracks and tags).
> >
> > There is a hitch in bringing in the data needed to generate the logs
> since
> > that part of MB is not Apache compatible.  I am working on that issue.
> >
> > Technically, the data is in a massively normalized relational form right
> > now, but it isn't terribly hard to denormalize into a form that we need.
> >
> >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message