mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joshua Bronson <jabron...@gmail.com>
Subject Re: mahout for news recommendation?
Date Wed, 01 Apr 2009 20:58:54 GMT
Oh, it just occurred to me I should have said that /usr/local/melk/mahout is
a checkout of http://svn.apache.org/repos/asf/lucene/mahout/trunk/. When I
got to step 2, which says...

> 2. Unpack the archive and copy movies.dat and ratings.dat to
> trunk/taste-web/src/main/resources/org/apache/mahout/cf/taste/example/grouplens under
> the Mahout distribution directory.


...I assumed the instructions had left out the step of running "svn
checkout http://svn.apache.org/repos/asf/lucene/mahout/trunk/". Was this
assumption incorrect?

I did have to "mkdir
-p trunk/taste-web/src/main/resources/org/apache/mahout/cf/taste/example/grouplens"
before I could copy the .dat files there as
the trunk/taste-web/src/main/resources directory of the checkout doesn't
contain anything in it. Did I go off on the wrong track?

On Wed, Apr 1, 2009 at 4:27 PM, Joshua Bronson <jabronson@gmail.com> wrote:

> You mean you're supposed to do step 4 *before* step 8?!? ;p
> I did run mvn install, and though I got a bunch of warnings like the
> following:
>
> [WARNING] Entry:
>> mahout-0.2-SNAPSHOT/usr/local/melk/mahout/core/src/main/java/org/apache/mahout/cf/taste/impl/common/
>> longer than 100 characters.
>>
>
> after a couple hours it said it completed successfully:
>
> [INFO]
>> ------------------------------------------------------------------------
>>
> [INFO] Reactor Summary:
>>
> [INFO]
>> ------------------------------------------------------------------------
>>
> [INFO] Mahout core ........................................... SUCCESS
>> [8:46.665s]
>>
> [INFO] Mahout Taste Webapp ................................... SUCCESS
>> [55.496s]
>>
> [INFO] Mahout examples ....................................... SUCCESS
>> [55.317s]
>>
> [INFO] Apache Lucene Mahout .................................. SUCCESS
>> [2:02:03.392s]
>>
> [INFO]
>> ------------------------------------------------------------------------
>>
> [INFO]
>> ------------------------------------------------------------------------
>>
> [INFO] BUILD SUCCESSFUL
>>
> [INFO]
>> ------------------------------------------------------------------------
>>
> [INFO] Total time: 132 minutes 41 seconds
>>
> [INFO] Finished at: Wed Apr 01 00:59:27 EDT 2009
>>
> [INFO] Final Memory: 61M/80M
>>
> [INFO]
>> ------------------------------------------------------------------------
>>
>
>
> So I proceeded through steps 5, 6, and 7, and then step 8's "mvn package"
> command failed with the output I linked to.
>
> Just for the heck of it I tried "mvn install" again (from the top-level
> directory) and after getting a bunch of the "longer-than-100-characters"
> warnings again, this time after 7 minutes it failed with:
>
> [ERROR] BUILD ERROR
>>
> [INFO]
>> ------------------------------------------------------------------------
>>
> [INFO] Failed to create assembly: Error creating assembly archive project:
>> A tar file cannot include itself.
>>
>
>
> I posted the full transcript of my console session at
> http://melkjug.org/_static/grouplens-install-log.txt. Seems like something
> funky's going on with tar, but I'm not sure what.
>
>
> On Wed, Apr 1, 2009 at 12:11 PM, Grant Ingersoll <gsingers@apache.org>wrote:
>
>> Do a "mvn install" from the top level directory first:
>> http://lucene.apache.org/mahout/taste.html#demo
>>
>> HTH,
>> Grant
>>
>>
>> On Apr 1, 2009, at 11:35 AM, Joshua Bronson wrote:
>>
>>  Thanks all for the good info. Taste definitely sounds like a promising
>>> direction for us to go in for our recommendation service.
>>> I'm working through the installation of the GroupLens demo, but the mvn
>>> package step is failing with the output at
>>> http://paste.pocoo.org/show/110618/. Haven't looked into this yet, just
>>> thought I'd post to the list first with my progress. If anyone else uses
>>> IRC, I've created (and am currently the only one in) the #mahout channel
>>> on
>>> freenode. Hope to see some of you in there!
>>>
>>> Josh
>>>
>>> On Wed, Apr 1, 2009 at 5:48 AM, Sean Owen <srowen@gmail.com> wrote:
>>>
>>>  Couple clarifications -
>>>>
>>>> The CF components are oriented to on-line, real-time use, though of
>>>> course
>>>> one can trivially build a batch job out of that. That is what I did with
>>>> the
>>>> EC2 image that cranks out recommendations for all users.
>>>>
>>>> The CF component is also already parallelized as much as is practical.
>>>> There
>>>> are already Hadoop jobs for parallel, batch operation.
>>>>
>>>> Finally if you have some external notion of item similarity, like text
>>>> similarity between articles, you can and should include this info by
>>>> creating an ItemSimilarity with this knowledge. In that case you want to
>>>> use
>>>> an item-based recommender, since it is only in such a case that
>>>> item-based
>>>> recommenders have a distinct advantage.
>>>>
>>>> On Apr 1, 2009 10:32 AM, "Otis Gospodnetic" <otis_gospodnetic@yahoo.com
>>>> >
>>>> wrote:
>>>>
>>>>
>>>> it's the former.  Taste is still not parallelized, but other parts of
>>>> Mahout
>>>> are, and they make use of Hadoop.
>>>>
>>>>
>> --------------------------
>> Grant Ingersoll
>> http://www.lucidimagination.com/
>>
>> Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using
>> Solr/Lucene:
>> http://www.lucidimagination.com/search
>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message