lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Niraj Alok" <>
Subject indexing size
Date Tue, 31 Aug 2004 05:47:14 GMT

I am indexing plain xml files , total size of which is around 100 MB. I am
creating two indexes for different modules, and they are stored in different
directories as I am not merging them. The problem is that the combined size
of these indexes is about 300 MB, ( 3 times the data size), which is in
contrast to the 35% I have read it should create.
Both these indexes have different fields and different data is stored in
them and hence there is no duplication occuring.

I have one indexwriter for each index. After both the indexes have been
created, I am simply calling optimize on these two writers and closing them.

Is there something I am doing wrong? I am using writer.addDocument(doc).


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message