lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mader, Volker" <VMa...@heiler.com>
Subject AW: AW: Performance with 5 Millions indexed items
Date Tue, 10 Sep 2002 12:10:13 GMT
Did you test FuzzyQuery? It is one part, which is terribly slow, when using WildcardQuery (With
wildcard at the end!), my query is  m u c h  faster.
Any experience with FuzzyQuery?

> -----Urspr√ľngliche Nachricht-----
> Von: tate@pisoftware.com [mailto:tate@pisoftware.com]
> Gesendet: Dienstag, 10. September 2002 11:09
> An: Mader, Volker
> Cc: lucene-user@jakarta.apache.org; tate@pisoftware.com
> Betreff: Re: AW: Performance with 5 Millions indexed items
> 
> 
> We used the standard demo supplied with lucene.  Check out the
> lucene-demos-1.2.war source for fields and indexing classes.
> MergeFactor=10.
> Dual CPU 1.2MGhz running Linux with jdk1.4
> 
> Performed boolean searches, phrases, wildcards and word proximity.
> 
> Can you send a fragment of your indexing and searching source?
> 
> > It's a completely local installation.
> > We used the standard mergeFactor.
> >
> > Could you please describe your scenario? What classes/methods do you
> > use for indexing/searching? How big are your indexed fields? What is
> > your hardware?
> >
> > Sorry for this questions, but perhaps the answers help me understand
> > lucene a little more.
> >
> > Thanks
> >
> >
> >> -----Urspr√ľngliche Nachricht-----
> >> Von: tate@pisoftware.com [mailto:tate@pisoftware.com]
> >> Gesendet: Dienstag, 10. September 2002 10:01
> >> An: lucene-user@jakarta.apache.org
> >> Cc: Mader, Volker
> >> Betreff: Re: Performance with 5 Millions indexed items
> >>
> >>
> >> We have been successful at loading 10 million documents with
> >> 3 fields and
> >> performing acceptable search response times (1-2secs).
> >> Using a network / mounted drive?
> >>
> >> What was your mergeFactor?
> >>
> >>
> >> > Hi,
> >> >
> >> > I've got a question about performance with "bigger" 
> indexes. We used
> >> > IndexWriter with GermanAnalyzer to index data with the following
> >> > fields:
> >> >
> >> > Field1: ID (a long value)
> >> > Field2: Description (a free text)
> >> > Field3: Groups (a list of up to 10 long values encoded 
> in a single
> >> > string) Field4: Classes (a list of up to 10 long values 
> encoded in a
> >> > single string)
> >> >
> >> > Documents are created with the 4 fields and then added to the
> >> > Indexwriter. After all the index is optimized.
> >> >
> >> > Searching now for a word in field "Description" using
> >> > IndexSearcher(GermanAnalyzer) with FuzzyQuery leads to
> >> search times up
> >> > to 30 seconds on a Pentium 4 1,4GHz. Also the retrieval with
> >> > hits.doc(..) is very slow.
> >> >
> >> > Any ideas?
> >> >
> >> > Volker
> >> >
> >> > --
> >> > To unsubscribe, e-mail:
> >> > <mailto:lucene-user-unsubscribe@jakarta.apache.org> For 
> additional
> >> > commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>
> >>
> >>
> >>
> >>
> 
> 
> 
> 

--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message