Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 37354 invoked from network); 15 Jul 2009 11:25:19 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 15 Jul 2009 11:25:19 -0000 Received: (qmail 53602 invoked by uid 500); 15 Jul 2009 11:25:28 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 53530 invoked by uid 500); 15 Jul 2009 11:25:28 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 53520 invoked by uid 99); 15 Jul 2009 11:25:28 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Jul 2009 11:25:28 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of layapatwa1003@gmail.com designates 209.85.218.217 as permitted sender) Received: from [209.85.218.217] (HELO mail-bw0-f217.google.com) (209.85.218.217) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Jul 2009 11:25:20 +0000 Received: by bwz17 with SMTP id 17so3404976bwz.5 for ; Wed, 15 Jul 2009 04:24:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:received:in-reply-to :references:date:x-google-sender-auth:message-id:subject:from:to :content-type; bh=m29XgAju48gz6VfRvYWuPQQ1BVahyScK7nmEXsApdFk=; b=svQqE1f1mdO74K8g9eExcyRPN6sAPUOnPN1OTF/4oOu1sCjxQaQX+WQ0uyMtaaXIkh 0S5XGxfyL7AKbgBV20KY+2FGvfDEW/MQsblQUzLsPA2dS1+P7uAOqwvMo53ag7gLwBxm 9MI0NDwoeN8XMXBvwTour3GpnhgCcph4EX0BU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; b=W+CfMnqc71Iq4ChWm0M08ETKMlVbcBaclNTr0gaH7TMpgyEJH6jTkh1eUxu+Y2zHsj wDB7M9oRRy1T1NQHql+z/zl8REszRrq92NGxJOMTDTDl1rXbmGam+Z52qYdNQocw/Vml X1kiFo3LKf7YmNxdL/qV8mXTiaxbjHUJLzu1Y= MIME-Version: 1.0 Sender: layapatwa1003@gmail.com Received: by 10.204.115.135 with SMTP id i7mr7431159bkq.178.1247657098546; Wed, 15 Jul 2009 04:24:58 -0700 (PDT) In-Reply-To: References: Date: Wed, 15 Jul 2009 13:24:58 +0200 X-Google-Sender-Auth: 212c994650e89b4b Message-ID: Subject: Re: problems with taste. From: Laya Patwa To: mahout-user@lucene.apache.org Content-Type: multipart/alternative; boundary=0016e6d64688290f90046ebcceb1 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6d64688290f90046ebcceb1 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Did you get any recommendations? The preferences are different for each user in tdata.txt. In testdata.csv the preferences are all 1, but the items are different for each user. I think the pearson correlation is working, because the nearest n user list is working and the code generates the nearest n user list. On Wed, Jul 15, 2009 at 12:16 PM, Sean Owen wrote: > OK, hmm well it was kind of a long shot anyway. It should have worked even > so. > > I see one data file attached. There are some blank lines at the top, > though I don't think that will matter. > > You are putting quotes around the names. That means the item IDs have > quotes in their names, which is not what I think you intend. For > example, you do not have an item named 'red', you have an item named > '"red"' in your model. If you are looking for recommendations that > include the item 'red' you will not find any. But somehow I suspect > this is not the problem you are talking about. > > What algorithm are you using -- one involving a correlation-based > similarity metric like Pearson? I ask because most of your ratings > have exactly the same rating, which will cause correlations to be > undefined in some cases. You also have relatively little data. It may > be that there are simply few or no defined similarities between users > in the model and so no recommendations can be made. > > Add more, or more realistic, data and you should see better results > perhaps. > > > But I still then can't explain why two copies of the same file give > different results. I might have to see the code. > Yes do not send anything confidential. > > > On Wed, Jul 15, 2009 at 11:10 AM, Laya Patwa > wrote: > > Hey! > > I followed your instructions. It doesn't work even when I put each file > in > > separate directory. > > Maybe you should have a look at the code and the data files. I am > attaching > > 2 of the data files. I need to get permission for the 3rd one( it is also > a > > bit larger ). The code is almost the same as given in the documentation. > It > > is giving recommendations for the file testdata.csv. > > Cheers, > > Laya > > > > On Wed, Jul 15, 2009 at 11:43 AM, Sean Owen wrote: > >> > >> Hmm, I might have guessed there is some file encoding issues, related > >> to line breaks, since you say copying the file "breaks" it. But that > >> would explain, I think, why a copy would *work* rather than fail. > >> > >> One thing to be careful of is that FileDataModel tries to be clever > >> and allow you to post incremental updates to the data file by placing > >> similarly-named files in the same directory. How have you named your > >> files? To rule this out, put the files in separate directories, just > >> to make sure. > >> > >> Otherwise perhaps you can send me a sample of the data file or a > >> sample of your code to see what is going on. > >> > >> On Wed, Jul 15, 2009 at 10:23 AM, Laya Patwa > >> wrote: > >> > Hi! > >> > I am a student and right now I am working on a project named CoEUD. My > >> > task > >> > is to build a recommender system and I am using the taste recommender > >> > library that comes with mahout. > >> > > >> > I downloaded the subversion and installed it using maven and cygwin on > >> > windows and also on MacOS. I tried the grouplens demo that is given in > >> > the > >> > taste documentation and it also worked. Then I tried the example for > >> > user > >> > based recommender, but it is giving some problems as follows: > >> > 1) I have 3 different data files with me and they are in CSV format > >> > having > >> > values of userID, itemID and preference. The recommendations are > >> > generated > >> > for 1 data file and not for the others. There are no errors. The > program > >> > runs and generates an empty recommendation list for other data files > >> > 2)Another peculiar thing that is happening is that when I make a copy > of > >> > the > >> > data file for which the user based recommender example is working and > >> > use > >> > this data file, the recommendation list generated is empty. > >> > > >> > I am stuck with the above 2 problems and can't figure out why the > >> > recommendations are not generated for all the data files. Can you > please > >> > help me out. > >> > > >> > I am using eclipse. > >> > > >> > Cheers, > >> > Laya > >> > > > > > > --0016e6d64688290f90046ebcceb1--