lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erick Erickson" <>
Subject Re: substring indexing to avoid 'TooManyClauses' exception
Date Wed, 14 Nov 2007 21:52:24 GMT

Since your use-case is so restricted, I'd recommend that you
just construct a filter. I think you'll find it's much faster than
you'd think at first glance. Of course, "Your mileage may
vary" Is there any equivalent phrase like "Your kilometerage
may vary" <G>?

Most of the discussion in the archives has to do with the
more general case, so much of it probably doesn't apply to
your specific case.


On Nov 14, 2007 4:25 PM, Hardy Ferentschik <> wrote:

> On Tue, 13 Nov 2007 16:12:26 +0100, Erick Erickson
> <> wrote:
> Thanks for your help.
> > I'm certainly not an expert on ranking and scoring, but I've got to
> > assume that this approach influences scoring.
> No doubt. The question is if it matters for this particular use case. For
> this particualt field I will ever only have a simple right hand truncated
> search. The user cannot use span or phrase queries against this field, not
> even explicit AND. I don't think this approach makes much sense when
> indexing a whole block of text. I only want to use it for indexing a
> simple name which at most consits of a few words. I guess what I want to
> do here is comparable to a single column SQL LIKE query, eg SELECT FROM
> COMPANY WHERE COMPANY.NAME LIKE 'M%'. Of course this is only the simple
> case. There are other queries where I combine the name search with other
> fields which are indexed using for example a SnowballAnalyzer.
> > There are several approaches. There is a thread titled "I just don't
> > understand wildcards at all" that has a bunch of information about
> > wildcards, and searching the archive for "wildcards" will turn up a
> > wealth of information.
> Great. I will look into it.
> Thanks again.
> -- Hardy
> --
> Hartmut Ferentschik
> Ekholmsv.339 ,1, 127 45 Skärholmen, Sweden
> Phone: +46 855 923 676 (h); +46 704 225 097 (m)
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message