lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Eric Louvard <eric.louv...@hauk-sasko.de>
Subject Re: Optimizing index takes too long
Date Mon, 12 Nov 2007 15:37:17 GMT
You could have a look at this thread.
http://www.gossamer-threads.com/lists/lucene/java-user/29354

regards.

Barry Forrest schrieb:
> Hi,
> 
> Optimizing my index of 1.5 million documents takes days and days.
> 
> I have a collection of 10 million documents that I am trying to index
> with Lucene.  I've divided the collection into chunks of about 1.5 - 2
> million documents each.  Indexing 1.5 documents is fast enough (about
> 12 hours), but this results in an index directory containing about
> 35000 files.  Optimizing this index takes several days, which is a bit
> too long for my purposes.  Each sub-index is about 150G.
> 
> What can I do to make this process faster?
> 
> Thanks for your help,
> Barry
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 
> 
> 
> 


-- 
Mit freundlichen Grüßen

i. A. Éric Louvard
HAUK & SASKO Ingenieurgesellschaft mbH
Zettachring 2
D-70567 Stuttgart

Phone: +49 7 11 7 25 89 - 19
Fax: +49 7 11 7 25 89 - 50
E-Mail: eric.louvard@hauk-sasko.de
www: www.hauk-sasko.de
Niederlassung Stuttgart
Firmensitz: Markstr. 77, 44801 Bochum
Registergericht: Amtsgericht Bochum, HRB 2532
Geschäftsführer: Dr.-Ing. Pavol Sasko




---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message