Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CB8AD3375 for ; Fri, 6 May 2011 09:31:02 +0000 (UTC) Received: (qmail 78990 invoked by uid 500); 6 May 2011 09:31:00 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 78937 invoked by uid 500); 6 May 2011 09:31:00 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 78929 invoked by uid 99); 6 May 2011 09:31:00 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 May 2011 09:31:00 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dawid.weiss@gmail.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 May 2011 09:30:54 +0000 Received: by fxm7 with SMTP id 7so3827874fxm.35 for ; Fri, 06 May 2011 02:30:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=CiltaGAIasJZeYRyTDwmM+ELBGc3QZIFdkv046YFT/8=; b=jaU2Jeci5W9V7Y+xyXak3tXb4p0J4GzDR5C5J2M6AR4Rxhk3/QWd9GC5/3PZ6zW2s5 aaKu3ZF1s+PXWyxyYte8CHHpBtkB1bShSw19x9md5TxqTeRsY00c6yhX7AVw9dqEHcPR nUfgtkok0EYmzIMcf5GbOXNhUR5JswbOUJN9w= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=BhDs5ta7YTCp/ytANSPkdukn6mCdWw6vvYmaLb3Zebczb1ijKAvfvflfJhaDtn7S3L rL35W2FbRonT2cMfs2gSVOxn/Zlx8CDRhLONY2QF6A3BhtF5Nxgl+x0IgkH/1BFbMusy 449YPntJ/5SK+Jp10Mbi4BwatTy/pcSDZlZK4= Received: by 10.223.97.196 with SMTP id m4mr1081087fan.105.1304674234143; Fri, 06 May 2011 02:30:34 -0700 (PDT) MIME-Version: 1.0 Received: by 10.204.5.91 with HTTP; Fri, 6 May 2011 02:30:14 -0700 (PDT) In-Reply-To: References: From: Dawid Weiss Date: Fri, 6 May 2011 11:30:14 +0200 Message-ID: Subject: Re: Using Solr's (Auto)suggest with plain lucene To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=001517491d5446169204a29824e2 X-Virus-Checked: Checked by ClamAV on apache.org --001517491d5446169204a29824e2 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable So you'd like case-sensitive lookup, not case-insensitive lookup. FSTLookup just takes characters from the input and does not perform any normalization= . So, if you feed it terms with casing, you'll get (cased) suggestions back. If you need cased suggestions, but provide normalized (lowercased) prefixes you'll get nothing, although such a feature would be relatively easy to implement based on the automaton code currently in the SVN. Dawid On Fri, May 6, 2011 at 11:18 AM, Clemens Wyss wrote: > Thx Dawid, > Could you be more precise on 2. ? I'd like to leave the suggested terms > untouched, i.e. cased. > > Clemens > > > -----Urspr=C3=BCngliche Nachricht----- > > Von: Dawid Weiss [mailto:dawid.weiss@gmail.com] > > Gesendet: Freitag, 6. Mai 2011 11:12 > > An: java-user@lucene.apache.org > > Betreff: Re: Using Solr's (Auto)suggest with plain lucene > > > > 1. FSTLookup is in the trunk only, it has not been released. > > 2. Case-insensitive lookups are possible, you'd just need to pass > normalized > > tokens from which the dictionary is built and then use the same > > normalization at the time you query for suggestions. > > 3. "http://search-lucene.com/m/586gA4ccL11". I have no idea. > > > > Dawid > > > > On Fri, May 6, 2011 at 11:06 AM, Clemens Wyss > > wrote: > > > > > I have come across TSTLookup. > > > In which jar Do I find FSTLookup? Does any of these lookup-classes > > > allow case insensitive lookups? My terms are (unfortunately) cased. > > > > > > What Lookup "sits" behind this suggester > > > http://search-lucene.com/m/586gA4ccL11 > > > here? > > > > > > > > > > -----Urspr=C3=BCngliche Nachricht----- > > > > Von: Dawid Weiss [mailto:dawid.weiss@gmail.com] > > > > Gesendet: Donnerstag, 5. Mai 2011 15:00 > > > > An: java-user@lucene.apache.org > > > > Betreff: Re: Using Solr's (Auto)suggest with plain lucene > > > > > > > > If you check out the source code of solr/lucene, look at FSTLookup > > > > class > > > and > > > > FSTLookupTest -- you can populate FSTLookup manually with terms/ > > > > phrases from your index and then use the resulting automaton for > > suggestions. > > > > > > > > Dawid > > > > > > > > On Thu, May 5, 2011 at 2:54 PM, Clemens Wyss > > > > > > wrote: > > > > > > > > > I have implemented my index (in fact it's a plugable indexing API= ) > > > > > in "plain Lucene". It tried to implement a term suggestion > > > > > mechanism on my own, being not to happy so far. > > > > > At > > > > > http://search- > > > > lucene.com/m/0QBv41ssGlh/suggestion&subj=3DAuto+Suggest > > > > > I have seen Solr's auto suggestion for search terms. Appart from > > > > > not being "fuzzy" it looks quite promising. > > > > > > > > > > How easily can I Integrate Solr's suggestion with my Lucene index= ? > > > > > > > > > > Thx > > > > > Clemens > > > > > > > > > > -----------------------------------------------------------------= - > > > > > --- To unsubscribe, e-mail: > > > > > java-user-unsubscribe@lucene.apache.org > > > > > For additional commands, e-mail: java-user-help@lucene.apache.org > > > > > > > > > > > > > > --001517491d5446169204a29824e2--