Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 70291 invoked from network); 16 Dec 2009 02:08:46 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Dec 2009 02:08:46 -0000 Received: (qmail 66043 invoked by uid 500); 16 Dec 2009 02:08:44 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 65951 invoked by uid 500); 16 Dec 2009 02:08:44 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 65941 invoked by uid 99); 16 Dec 2009 02:08:44 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Dec 2009 02:08:44 +0000 X-ASF-Spam-Status: No, hits=-2.6 required=5.0 tests=BAYES_00,HTML_MESSAGE X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ww.wang.cs@gmail.com designates 209.85.211.185 as permitted sender) Received: from [209.85.211.185] (HELO mail-yw0-f185.google.com) (209.85.211.185) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Dec 2009 02:08:42 +0000 Received: by ywh15 with SMTP id 15so583729ywh.5 for ; Tue, 15 Dec 2009 18:08:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=D0gYGn0sP8Ei2Gz515qcSVnPACJni5dEQyTKQj/yjqc=; b=eQuhJzOKsBVpH64xVCcy8IK0y5iIR76AFHsRIgypvMO/1zVyqu+SjPE1KtXO/0yajw /1WvYQDRqfDeHNMPlf/p1HGMrN2c5QF9XumZOxFzsWxckKL5lfqJuzgowUWAWJpjNJeh aPBL6e9lbLmtv6iEftyzrUAIey5DmZr/B/Ux0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=Yt2T1sI6xqYPBQZC4gOKSFL9bYUkMPhgygi6GKKFZQ8hU6Mb7G7dSu8DORAjRAUrFo /jphVH3ExUGvjRnlSuQ8CsSvDO9bRZIT1iGxdimjLZbfV9g+LWIYpq3xKA6x9Lmy3ik9 P5Zs+gESE0bOJapFkzbDuK1jGUi33WKpczoRs= MIME-Version: 1.0 Received: by 10.90.11.30 with SMTP id 30mr543960agk.42.1260929301112; Tue, 15 Dec 2009 18:08:21 -0800 (PST) In-Reply-To: <76c1202b0912151747n19021b03n998154d3843d43f7@mail.gmail.com> References: <76c1202b0912142023i683e2596mf99b1f7def49b233@mail.gmail.com> <76c1202b0912151747n19021b03n998154d3843d43f7@mail.gmail.com> Date: Wed, 16 Dec 2009 10:08:20 +0800 Message-ID: <7d94dcde0912151808t2b07d482u9dd590db96f6c81a@mail.gmail.com> Subject: Re: Document category identification in query From: Weiwei Wang To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=00163616421f1465f8047acefb1d --00163616421f1465f8047acefb1d Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable I think you can do this with search suggestion like algorithms. First, you should categorize the search log, e.g. Thai Restaurant or Chines= e Restaurant or KFC should be assigned categories including Restaurant. When user is typing, figure out from the search log which keyword is neares= t to the input and take that keyword's categories as the user input's category. BTW, I do not understand why you need to know the category of user input On Wed, Dec 16, 2009 at 9:47 AM, Alex wrote: > Can anybody help me or maybe point me to relevant resources I could learn > from ? > > Thanks. > --=20 Weiwei Wang Alex Wang =E7=8E=8B=E5=B7=8D=E5=B7=8D Room 403, Mengmin Wei Building Computer Science Department Gulou Campus of Nanjing University Nanjing, P.R.China, 210093 Homepage: http://cs.nju.edu.cn/rl/weiweiwang --00163616421f1465f8047acefb1d--