mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pat Ferrel <...@occamsmachete.com>
Subject Re: Own recommender
Date Fri, 23 Jan 2015 22:39:10 GMT
spark-itemsimilarity uses SimialrityAnalysis,cooccurrence, which does the majority of the work.
Examples of how to use it and handle I/O are in the CLI driver mahout/spark/src/…/drivers/ItemSimilarityDriver.scala.
All of this is available as a library. The nature of Spark makes creating map-reduce and other
parallel ops almost transparent to the programmer and therefore the code is easier to use
as a lib. Calling Scala from Java is a bit tricky since Scala renames some things for access
by Java—things that don’t exist in Java. Calling Java from Scala is very easy. They are
mixed in the Mahout codebase.

On Jan 21, 2015, at 7:49 AM, Ted Dunning <ted.dunning@gmail.com> wrote:

Juanjo,

Using the Taste components, it will be almost impossible to get really high
performance.  For that, using the itemsimilarity program to feed a search
index is the best alternative.

The scala version of the itemsimilarity program is available in Scala and
could be called fairly easily as a library.  The older map-reduce version
is not easily used as a library.


On Wed, Jan 21, 2015 at 2:20 AM, Juanjo Ramos <jjarmos@gmail.com> wrote:

> Hi Manuel,
> Thanks for the update.
> 
> I'm using Mahout in a simple Java application myself. Following Ted's
> comment a few posts back, I was just concerned about the performance.
> 
> Is performance the only concern when using Taste or the algorithm's
> implementation has also been improved in the current implementations
> accessible via CLI.
> 
> Thanks.
> 
> On Wed, Jan 21, 2015 at 10:14 AM, Manuel Blechschmidt <
> Manuel.Blechschmidt@gmx.de> wrote:
> 
>> Hi Juan,
>> 
>>> On 21.01.2015, at 11:05, Juanjo Ramos <jjarmos@gmail.com> wrote:
>>> 
>>> Thanks Pat for the resources.
>>> 
>>> Please correct me if I'm wrong but all Mahout's latest tools are
> command
>>> line tools only, is that correct?
>> 
>> Yes, this is kind of correct. All tools are command line based. There was
>> some development for an interactive console similar to R
>> 
>> https://issues.apache.org/jira/browse/MAHOUT-1489 <
>> https://issues.apache.org/jira/browse/MAHOUT-1489>
>>> I was wondering if there is a library
>>> with the latest implementation that can be used in a Java or Scala
>> project?
>> 
>> The following project uses Mahout in a full blown simple Java EE
>> application:
>> 
>> https://github.com/ManuelB/facebook-recommender-demo <
>> https://github.com/ManuelB/facebook-recommender-demo>
>>> 
>>> Best.
>> 
>> /Manuel
>> 
>> --
>> Manuel Blechschmidt
>> Twitter: http://twitter.com/Manuel_B
>> 
>> 
> 


Mime
View raw message