lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fang...@emc.com
Subject RE: Lucene index performance
Date Wed, 20 Jun 2007 03:05:21 GMT
Hi Andreas,
	I am very interested in the multiple index file index/search.
Can you kindly help me on following questions?
1) Why you use multi index files? How much is the performance gain for
both indexing and searching? Someone reported that there no big
performance difference except the number if indices is huge, like 1000.
2) Are these index files located in a single machine or distributed into
multiple machines? 
3) How do you distribute the document into several index files?

Thanks a lot,
Li

-----Original Message-----
From: Andreas Guther [mailto:andreas.guther@gmail.com] 
Sent: Monday, June 18, 2007 4:00 AM
To: java-user@lucene.apache.org
Subject: Re: Lucene index performance

Searching on multiple index files is incredible fast.  We have 10
different
index folders with different sizes.  All folders together have a size of
7
GB.  Results come back usual within less than 50 ms.  Getting results
out of
the index i.e. reading documents is expensive and you will have to spent
time here to get a good performance.  You will need to look into
- Topdocs
- Extracting results in an ordered way, i.e. sort by index and within an
index by document id.  This will help to minimize disk head jumps and
gave
me a tremendous boost.
- Extracting only what you need (using a special read filter I do not
recall
the name right now and I do not have access to my sources at the moment
of
writing this)

Andreas


On 6/17/07, Mark Miller <markrmiller@gmail.com> wrote:
>
>
>
> Lee Li Bin wrote:
> > Hi,
> >
> > I would like to know how's the performance during indexing and
searching
> of
> > results on a large index files would be like.
> >
> Fast.
> > And is it possible to create multiple index files and search across
> multiple
> > index files?
> Yes.
> >  If possible, may I know how could it be done?
> >
> Check out MultiSearcher.
>
>
http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/or
g/apache/lucene/search/MultiSearcher.html
> > Thanks a lot.
> >
> >
> >
> >
> >
> >
---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
> >
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message