Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1BEC9EA40 for ; Sat, 16 Feb 2013 23:12:53 +0000 (UTC) Received: (qmail 26318 invoked by uid 500); 16 Feb 2013 23:12:51 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 26266 invoked by uid 500); 16 Feb 2013 23:12:51 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 26257 invoked by uid 99); 16 Feb 2013 23:12:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Feb 2013 23:12:51 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ted.dunning@gmail.com designates 209.85.220.54 as permitted sender) Received: from [209.85.220.54] (HELO mail-pa0-f54.google.com) (209.85.220.54) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Feb 2013 23:12:42 +0000 Received: by mail-pa0-f54.google.com with SMTP id fa10so2200679pad.13 for ; Sat, 16 Feb 2013 15:12:20 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:references:in-reply-to:mime-version :content-transfer-encoding:content-type:message-id:cc:x-mailer:from :subject:date:to; bh=RTcMssgDk5y5eXVcyEBeDGOvI+nYdIelC6E4ookiJUo=; b=gmy2zl6gr5qJJAp3FUmUo54X1bT9uwI74U6NQpFekoQ5KDkM9DpRmmZ0b25QNU2i7x 7iANqLFs631sSn7B7aUZeWfTfhnb65G9ze0n0SAubr0idKEUtG02Pk4Zq4i17d2iA4EA MfhWyS9b/03V9VmFCLUtAiAS0MyIIAhHdZpP9wGFCJjCTCSeoA6Q9+36v+9NVsAS7iP3 ezsYqUrxriv0yo73hScjrtvhD+s3nUlKueXuzVST9WI60Ro8l0SirTnxx+HEp2QiEfYx 5u/hPOd0EFJIo97JQZFrw6bIPSsLDU+Teey3zq1OMzFfVs/6hv4vdryRNGnbthkdfRf1 wpvQ== X-Received: by 10.66.82.67 with SMTP id g3mr23953860pay.58.1361056340610; Sat, 16 Feb 2013 15:12:20 -0800 (PST) Received: from [10.16.69.78] (mobile-166-137-181-107.mycingular.net. [166.137.181.107]) by mx.google.com with ESMTPS id km10sm9233330pbc.9.2013.02.16.15.12.17 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sat, 16 Feb 2013 15:12:19 -0800 (PST) References: <1361039400.65635.YahooMailNeo@web140003.mail.bf1.yahoo.com> <1361042819.19712.YahooMailNeo@web140001.mail.bf1.yahoo.com> In-Reply-To: Mime-Version: 1.0 (1.0) Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii Message-Id: Cc: "user@mahout.apache.org" X-Mailer: iPhone Mail (9A406) From: Ted Dunning Subject: Re: Problems with Mahout's RecommenderIRStatsEvaluator Date: Sat, 16 Feb 2013 16:12:11 -0700 To: "user@mahout.apache.org" X-Virus-Checked: Checked by ClamAV on apache.org There are a variety of common time based effects which make time splits best= in many practical cases. Having the training data all be from the past emu= lates this better than random splits.=20 For one thing, you can have the same user under different names in training a= nd test. For another thing, in real life you get data from the past of the u= ser under consideration. As a third consideration, topical events can influe= nce all users in common. =20 These all mean that random training splits can have very large error in esti= mated performance.=20 Sent from my iPhone On Feb 16, 2013, at 1:41 PM, Tevfik Aytekin wrote= : > What I mean is you can choose ratings randomly and try to recommend > the ones above the threshold >=20 > On Sat, Feb 16, 2013 at 10:32 PM, Sean Owen wrote: >> Sure, if you were predicting ratings for one movie given a set of ratings= >> for that movie and the ratings for many other movies. That isn't what the= >> recommender problem is. Here, the problem is to list N movies most likely= >> to be top-rated. The precision-recall test is, in turn, a test of top N >> results, not a test over prediction accuracy. We aren't talking about RMS= E >> here or even any particular means of generating top N recommendations. Yo= u >> don't even have to predict ratings to make a top N list. >>=20 >>=20 >> On Sat, Feb 16, 2013 at 9:28 PM, Tevfik Aytekin wrote: >>=20 >>> No, rating prediction is clearly a supervised ML problem >>>=20 >>> On Sat, Feb 16, 2013 at 10:15 PM, Sean Owen wrote: >>>> This is a good answer for evaluation of supervised ML, but, this is >>>> unsupervised. Choosing randomly is choosing the 'right answers' randoml= y, >>>> and that's plainly problematic. >>>>=20 >>>>=20 >>>> On Sat, Feb 16, 2013 at 8:53 PM, Tevfik Aytekin < >>> tevfik.aytekin@gmail.com>wrote: >>>>=20 >>>>> I think, it is better to choose ratings of the test user in a random >>>>> fashion. >>>>>=20 >>>>> On Sat, Feb 16, 2013 at 9:37 PM, Sean Owen wrote: >>>>>> Yes. But: the test sample is small. Using 40% of your data to test is= >>>>>> probably quite too much. >>>>>>=20 >>>>>> My point is that it may be the least-bad thing to do. What test are >>> you >>>>>> proposing instead, and why is it coherent with what you're testing? >>>>>>=20 >>>>>=20 >>>=20