From user-return-3731-apmail-mahout-user-archive=mahout.apache.org@mahout.apache.org Thu Jun 17 13:15:23 2010 Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 29055 invoked from network); 17 Jun 2010 13:15:23 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 17 Jun 2010 13:15:23 -0000 Received: (qmail 2534 invoked by uid 500); 17 Jun 2010 07:48:43 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 2243 invoked by uid 500); 17 Jun 2010 07:48:40 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 2235 invoked by uid 99); 17 Jun 2010 07:48:40 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Jun 2010 07:48:40 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of gkhncpn@gmail.com designates 74.125.83.42 as permitted sender) Received: from [74.125.83.42] (HELO mail-gw0-f42.google.com) (74.125.83.42) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Jun 2010 07:48:32 +0000 Received: by gwj21 with SMTP id 21so523119gwj.1 for ; Thu, 17 Jun 2010 00:48:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=vofODzTZgb30yEigFNX8S6Ag22JGt1oEPEz438kNw4w=; b=xpEiwn/hRsC60cROnVbDsTjx3zljkZ4brHMvYCcuOHQMWh/pwgcrBMosAzcQtslui6 zRUZNuVDo+O9hZPBTADAPzUYA0Oig+v+zl0lSGwflmOggMuLb8YzWMamgO8YbFG1Efnh 2GE894+dH0mN2EEviBsCPmGLcs/LI7cNZmKAA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=Upu79SEzOgjMnhv49EdkOJffSXfKHquqruBRPcHKIssaKySwy4QwQsi6F37/F2bwT2 6++RclFc79cn96wwdKfw3jiJgNaEXCvzCeZhUKZLsDZ9ITBfQLBXsI6CChdEVGvE9Bnm +cYcCBRxHC0WCWwSduChkNPIhxFNaquK/E8s4= MIME-Version: 1.0 Received: by 10.101.5.22 with SMTP id h22mr8333498ani.104.1276760889542; Thu, 17 Jun 2010 00:48:09 -0700 (PDT) Received: by 10.100.209.3 with HTTP; Thu, 17 Jun 2010 00:48:09 -0700 (PDT) In-Reply-To: References: Date: Thu, 17 Jun 2010 10:48:09 +0300 Message-ID: Subject: Re: Predicting Successor Item From: =?UTF-8?B?R8O2a2hhbiDDh2FwYW4=?= To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=00504502d82448c7700489350f75 X-Virus-Checked: Checked by ClamAV on apache.org --00504502d82448c7700489350f75 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Sean, Ted, Thanks for sharing your ideas. I will try things we have talked about, compare with current recommendation techniques in terms of performance and accuracy, and share the results with all Mahouts. On Wed, Jun 16, 2010 at 7:05 PM, Ted Dunning wrote: > I would follow Sean's suggestion and try simpler methods first. My guess > is > that the important structure of the HMM may be much easier to learn by > sparsification techniques. > > Sequence aware methods also have potential for harm in that they may just > be > reverse-engineer your current link structure. Reverse links might well b= e > just as useful to the customer. > > I generally subtract the users history from recommendations in any case. > That gives most of the benefit of sequence based recommenders (in the > areas > of most historical interest for me ... music and videos). Even without > direct sequence information, I found that a simple item based recommende= r > would discern the episodic structure of video series without any problem. > The top link would be the next episode and the second best link would > typically be the previous episode, but if the person had just come from t= he > previous episode, it wouldn't appear. > > On Wed, Jun 16, 2010 at 12:44 AM, G=C3=B6khan =C3=87apan wrote: > > > My first HMM ideas came > > after this, but Sean said that it may be overkill. What do you think > about > > HMM, is it worth to try? > > > --=20 G=C3=B6khan =C3=87apan --00504502d82448c7700489350f75--