lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Carlson <carl...@bookandhammer.com>
Subject Re: AW: AW: Performance with 5 Millions indexed items
Date Tue, 10 Sep 2002 15:52:20 GMT
FuzzyQuery is an extremely expensive query and I could understand how 
it could be slow with an index with a lot of terms.

--Peter


On Tuesday, September 10, 2002, at 05:10 AM, Mader, Volker wrote:

> Did you test FuzzyQuery? It is one part, which is terribly slow, when 
> using WildcardQuery (With wildcard at the end!), my query is  m u c h  
> faster.
> Any experience with FuzzyQuery?
>
>> -----Urspr√ľngliche Nachricht-----
>> Von: tate@pisoftware.com [mailto:tate@pisoftware.com]
>> Gesendet: Dienstag, 10. September 2002 11:09
>> An: Mader, Volker
>> Cc: lucene-user@jakarta.apache.org; tate@pisoftware.com
>> Betreff: Re: AW: Performance with 5 Millions indexed items
>>
>>
>> We used the standard demo supplied with lucene.  Check out the
>> lucene-demos-1.2.war source for fields and indexing classes.
>> MergeFactor=10.
>> Dual CPU 1.2MGhz running Linux with jdk1.4
>>
>> Performed boolean searches, phrases, wildcards and word proximity.
>>
>> Can you send a fragment of your indexing and searching source?
>>
>>> It's a completely local installation.
>>> We used the standard mergeFactor.
>>>
>>> Could you please describe your scenario? What classes/methods do you
>>> use for indexing/searching? How big are your indexed fields? What is
>>> your hardware?
>>>
>>> Sorry for this questions, but perhaps the answers help me understand
>>> lucene a little more.
>>>
>>> Thanks
>>>
>>>
>>>> -----Urspr√ľngliche Nachricht-----
>>>> Von: tate@pisoftware.com [mailto:tate@pisoftware.com]
>>>> Gesendet: Dienstag, 10. September 2002 10:01
>>>> An: lucene-user@jakarta.apache.org
>>>> Cc: Mader, Volker
>>>> Betreff: Re: Performance with 5 Millions indexed items
>>>>
>>>>
>>>> We have been successful at loading 10 million documents with
>>>> 3 fields and
>>>> performing acceptable search response times (1-2secs).
>>>> Using a network / mounted drive?
>>>>
>>>> What was your mergeFactor?
>>>>
>>>>
>>>>> Hi,
>>>>>
>>>>> I've got a question about performance with "bigger"
>> indexes. We used
>>>>> IndexWriter with GermanAnalyzer to index data with the following
>>>>> fields:
>>>>>
>>>>> Field1: ID (a long value)
>>>>> Field2: Description (a free text)
>>>>> Field3: Groups (a list of up to 10 long values encoded
>> in a single
>>>>> string) Field4: Classes (a list of up to 10 long values
>> encoded in a
>>>>> single string)
>>>>>
>>>>> Documents are created with the 4 fields and then added to the
>>>>> Indexwriter. After all the index is optimized.
>>>>>
>>>>> Searching now for a word in field "Description" using
>>>>> IndexSearcher(GermanAnalyzer) with FuzzyQuery leads to
>>>> search times up
>>>>> to 30 seconds on a Pentium 4 1,4GHz. Also the retrieval with
>>>>> hits.doc(..) is very slow.
>>>>>
>>>>> Any ideas?
>>>>>
>>>>> Volker
>>>>>
>>>>> --
>>>>> To unsubscribe, e-mail:
>>>>> <mailto:lucene-user-unsubscribe@jakarta.apache.org> For
>> additional
>>>>> commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>
>>>>
>>>>
>>>>
>>>>
>>
>>
>>
>>
>
> --
> To unsubscribe, e-mail:   
> <mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail: 
> <mailto:lucene-user-help@jakarta.apache.org>
>
>


--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message