lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rob Jose" <>
Subject Re: Index Size
Date Thu, 19 Aug 2004 13:37:42 GMT
Thanks for responding.  Yes, I optimize right before I close the index
writer.  I added this a little while ago to try and get the size down.

----- Original Message ----- 
From: "Karthik N S" <>
To: "Lucene Users List" <>
Sent: Thursday, August 19, 2004 12:59 AM
Subject: RE: Index Size


   Are u Using the Optimizing  the index before close process.....

  If not try using it...  :}


-----Original Message-----
From: Honey George []
Sent: Thursday, August 19, 2004 1:00 PM
To: Lucene Users List
Subject: Re: Index Size

 Please check for hidden files in the index folder. If
you are using linx, do something like

ls -al <index folder>

I am also facing a similar problem where the index
size is greater than the data size. In my case there
were some hidden temproary files which the lucene
That was taking half of the total size.

My problem is that after deleting the temporary files,
the index size is same as that of the data size. That
again seems to be a problem. I am yet to find out the


 --- Rob Jose <> wrote:
> Hello
> I have indexed several thousand (52 to be exact)
> text files and I keep running out of disk space to
> store the indexes.  The size of the documents I have
> indexed is around 2.5 GB.  The size of the Lucene
> indexes is around 287 GB.  Does this seem correct?
> I am not storing the contents of the file, just
> indexing and tokenizing.  I am using Lucene 1.3
> final.  Can you guys let me know what you are
> experiencing?  I don't want to go into production
> with something that I should be configuring better.
> I am not sure if this helps, but I have a temp index
> and a real index.  I index the file into the temp
> index, and then merge the temp index into the real
> index using the addIndexes method on the
> IndexWriter.  I have also set the production writer
> setUseCompoundFile to true.  I did not set this on
> the temp index.  The last thing that I do before
> closing the production writer is to call the
> optimize method.
> I would really appreciate any ideas to get the index
> size smaller if it is at all possible.
> Thanks
> Rob

___________________________________________________________ALL-NEW Yahoo!
Messenger - all new features - even more fun!

To unsubscribe, e-mail:
For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message