Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 343AA6A51 for ; Wed, 27 Jul 2011 08:50:47 +0000 (UTC) Received: (qmail 11461 invoked by uid 500); 27 Jul 2011 08:45:05 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 9063 invoked by uid 500); 27 Jul 2011 08:44:38 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 8338 invoked by uid 99); 27 Jul 2011 08:44:34 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Jul 2011 08:44:34 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of srowen@gmail.com designates 209.85.161.170 as permitted sender) Received: from [209.85.161.170] (HELO mail-gx0-f170.google.com) (209.85.161.170) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Jul 2011 08:44:27 +0000 Received: by gxk27 with SMTP id 27so1711095gxk.1 for ; Wed, 27 Jul 2011 01:44:07 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=aGPX0jC5o4DhPD2X4u8/mD7oSglB7yhJANs7FLEzdPM=; b=DHWVXcLsY/BDB3r4/L6ovJ/7NnjxOF9kRs+KZ+TK6mjFB4P+KmNVeAA0lj56Yekcol xDzmWzU/Hw3JLL2TyeNT0HnnuqPSs5B/0Ll233LtaDMIDy48RfsGxRm2gqQalliNSP8P 5ofyZaf630KrG7l00pQbJcrCEHmtzt7rSVbE0= MIME-Version: 1.0 Received: by 10.100.247.12 with SMTP id u12mr3983889anh.76.1311756246748; Wed, 27 Jul 2011 01:44:06 -0700 (PDT) Received: by 10.100.240.4 with HTTP; Wed, 27 Jul 2011 01:44:06 -0700 (PDT) In-Reply-To: References: Date: Wed, 27 Jul 2011 09:44:06 +0100 Message-ID: Subject: Re: Advice request From: Sean Owen To: user@mahout.apache.org Cc: Srinivas Kasturi Content-Type: multipart/alternative; boundary=00163691ff321e8e2704a9090dd4 --00163691ff321e8e2704a9090dd4 Content-Type: text/plain; charset=UTF-8 Sounds good. In that case, the surprise-n-coincidence counterpart you are probably looking for it LogLikelihoodSimilarity, which implements ItemSimilarity. Use it with a GenericBooleanPrefItemBasedRecommender and you can recommend new words to use. On Wed, Jul 27, 2011 at 9:01 AM, Ted Dunning wrote: > Actually, I think that recommending words to people and then doing the > search may add some mileage. > > On Wed, Jul 27, 2011 at 12:38 AM, Sean Owen wrote: > > > It's just a search problem as Ted says -- minus > > even the recommendation phase. > > > > Is that all you want? then try Lucene, probably. > > > --00163691ff321e8e2704a9090dd4--