mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Musselman <andrew.mussel...@gmail.com>
Subject Re: Popularity of recommender items
Date Fri, 14 Feb 2014 20:43:53 GMT
Oh yes.  I do have a small team I could enlist to do things like this; is
there a starting point somewhere on Github, Ted?


On Fri, Feb 14, 2014 at 12:39 PM, Ted Dunning <ted.dunning@gmail.com> wrote:

> Yes!
>
> But it is very hard to find the time.
>
>
>
> On Fri, Feb 14, 2014 at 11:51 AM, Andrew Musselman <
> andrew.musselman@gmail.com> wrote:
>
> > I'd like to see cross-recommendations added too.
> >
> > But I also want some automation of the steps required to build a simple
> > recommender like the solr/mahout example Ted and Ellen have in their
> > pamphlet.
> >
> > Lowering the barrier to entry by providing a sample pipeline would help a
> > lot of folks get started and hopefully would keep them interested.
>  Perhaps
> > in examples/bin?
> >
> >
> > On Fri, Feb 14, 2014 at 10:56 AM, Pat Ferrel <pat@occamsmachete.com>
> > wrote:
> >
> > > There's been work done on the cross-recommender. There is a
> Mahout-style
> > > XRecommenderJob that has two preference models for two actions or
> > > preference types. It uses matrix multiply to get a cooccurrence type
> > > similarity matrix. If we had a cross-row-similarity-job, it could
> pretty
> > > easily be integrated and I'd volunteer to integrate it. The XRSJ is
> > > probably beyond me right now so if we can scare up someone to do that
> > we'd
> > > be a long way down the road.
> > >
> > > I'll put a feature request into Jira and take this to the dev list
> > >
> > > BTW this is already integrated with the solr-recommender.
> > >
> > > On Feb 8, 2014, at 7:19 PM, Ted Dunning <ted.dunning@gmail.com> wrote:
> > >
> > > I have different opinions about each piece.
> > >
> > > I think that cross recommendation is as core as RowSimilarityJob and
> > should
> > > be a parallel implementation or integrated.  Parallel is probably
> easier.
> > > It is even plausible to have a version of RowSimilarityJob that doesn't
> > > support all the different distance measures but does support multiple
> > cross
> > > and direct processing using LLR or related cooccurrence based measures.
> >  It
> > > would be very cool if a single pass over the data could do many kinds
> of
> > co
> > > or cross occurrence operations.
> > >
> > > For dithering, it really is post processing.  That said, it is also the
> > > single largest improvement that anybody typically gets when testing
> > > different options so it is a bit goofy to not have good support for
> some
> > > kinds of dithering.
> > >
> > > For Thompson sampled recommenders, I am not sure where to start hacking
> > on
> > > our current code.
> > >
> > >
> > >
> > >
> > >
> > >
> > > On Sat, Feb 8, 2014 at 4:53 PM, Pat Ferrel <pat@occamsmachete.com>
> > wrote:
> > >
> > > > That was by no means to criticize effort level, which has been
> > impressive
> > > > especially during the release.
> > > >
> > > > It was more a question about the best place to add these things and
> > > > whether they are important. Whether people see these things as custom
> > > post
> > > > processing or core.
> > > >
> > > > On Feb 8, 2014, at 12:13 PM, Ted Dunning <ted.dunning@gmail.com>
> > wrote:
> > > >
> > > > ...
> > > >
> > > > The reason that we aren't adding this like cross-rec and other things
> > is
> > > > that "we" have full-time jobs, mostly.  Suneel is full-time on
> Mahout,
> > > but
> > > > the rest are not.  You seem more active than most.
> > > >
> > > >
> > > >
> > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message