Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 08DA44096 for ; Thu, 7 Jul 2011 19:23:02 +0000 (UTC) Received: (qmail 10633 invoked by uid 500); 7 Jul 2011 19:23:00 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 10537 invoked by uid 500); 7 Jul 2011 19:23:00 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 10529 invoked by uid 99); 7 Jul 2011 19:23:00 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2011 19:23:00 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dawid.weiss@gmail.com designates 209.85.161.176 as permitted sender) Received: from [209.85.161.176] (HELO mail-gx0-f176.google.com) (209.85.161.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2011 19:22:53 +0000 Received: by gxk7 with SMTP id 7so743242gxk.35 for ; Thu, 07 Jul 2011 12:22:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=FpnHbqxdkuhs5Yr1ens0UoTP1BHjVkRoSHwObJCYdvc=; b=OMxRf2eAG3RQpx2rZdFnKZr1U6UGl0ndfD1Qp0KwvlaWa/yYs+JeeC2LUfiDNQwA31 cME5WUVHQA3FDqXYI07inihxmEzeKNX2U3VcaX5OUjzfPwirg3ALLNhEas2Kj5W+eplm i4AG3Qff7HCQZHC7xH6s/uPapRB3x5YYs6tNw= Received: by 10.91.199.39 with SMTP id b39mr1382490agq.27.1310066552094; Thu, 07 Jul 2011 12:22:32 -0700 (PDT) MIME-Version: 1.0 Received: by 10.90.14.23 with HTTP; Thu, 7 Jul 2011 12:22:11 -0700 (PDT) In-Reply-To: References: <1309968498.25963.17.camel@elmer-P35-DS3P> <5D6C36CFCB0B4AF38BEAC2E0240D22E5@ElmerPC> From: Dawid Weiss Date: Thu, 7 Jul 2011 21:22:11 +0200 Message-ID: Subject: Re: Autocompletion on large index To: java-user@lucene.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org You can actually make an (relatively easy) change to FSTLookup to allow infix matches (or word-boundary matches). This should have little impact on memory and nearly zero on performance. This issue is tracking this: https://issues.apache.org/jira/browse/SOLR-2479 I should have implemented it a while ago, but I've been swamped with other work, sorry. Dawid On Thu, Jul 7, 2011 at 7:16 PM, Michael McCandless wrote: > On Thu, Jul 7, 2011 at 7:00 AM, Dawid Weiss wrote: >> Another option to tradeoff dize and mem is to do a lru like cache of suffix >> nodes/ registry. Im still working on that api replacement patch so any >> changes to fst right now scare me... > > That sounds cool too! > > I opened LUCENE-3289 to allow controlling how hard the Builder tries > to share suffixes... ie tradeoff CPU/RAM usage while building against > final FST size. > > Mike McCandless > > http://blog.mikemccandless.com > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org