lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rob Staveley (Tom)" <rstave...@seseit.com>
Subject RE: Seeing what's occupying all the space in the index
Date Fri, 26 May 2006 20:17:37 GMT
Indexing 55648 documents in a new clean directory, I see only .cfs files (+
deletable  + segments). Disk usage is 65K for all of these, which means that
each message takes ~1K of index space rather than > 10K as it does in my
99GB index.

Bearing in mind that the large index has > 5 million Lucene documents
indexed in it now, do you reckon I can merge the .fdt, .prx and .frq into a
compound index?

-----Original Message-----
From: Grant Ingersoll [mailto:gsingers@syr.edu]
Sent: 26 May 2006 18:38
To: java-user@lucene.apache.org
Subject: Re: Seeing what's occupying all the space in the index

> Can you try a smaller sample in a clean directory and see what size it is
(so that it doesn't take as long to index)?

Mime
View raw message