lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Niraj Alok" <ni...@emacmillan.com>
Subject Re: indexing size
Date Tue, 31 Aug 2004 09:45:01 GMT
Hi Guys,

If you have any ideas, please help me out. I have looked into most of the
lucene archives and they are suggesting what I am currently doing. So the
only possible solution for me right now would be to reduce the no. of fields
which could severely change the logic used for searching.


Regards,
Niraj
----- Original Message -----
From: "Niraj Alok" <niraj@emacmillan.com>
To: "Lucene Users List" <lucene-user@jakarta.apache.org>
Sent: Tuesday, August 31, 2004 11:17 AM
Subject: indexing size


> Hi,
>
> I am indexing plain xml files , total size of which is around 100 MB. I am
> creating two indexes for different modules, and they are stored in
different
> directories as I am not merging them. The problem is that the combined
size
> of these indexes is about 300 MB, ( 3 times the data size), which is in
> contrast to the 35% I have read it should create.
> Both these indexes have different fields and different data is stored in
> them and hence there is no duplication occuring.
>
> I have one indexwriter for each index. After both the indexes have been
> created, I am simply calling optimize on these two writers and closing
them.
>
> Is there something I am doing wrong? I am using writer.addDocument(doc).
>
> Regards,
> Niraj
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message