lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: New codecs keep Freq skip/omit Pos
Date Fri, 22 Apr 2011 16:32:28 GMT
On Fri, Apr 22, 2011 at 12:24 PM, Alex vB <mail@avomberg.de> wrote:
> I also indexed one time with Lucene 3.0. Are those sizes really completely
> the same?
>
> Standard 4.0 W Freq W Pos       28.1 GB
> Standard 4.0 W/O Freq W/O Pos   6.2 GB
> Standard 3.0 W Freq W Pos       28.1 GB
> Standard 3.0 WO Freq WO Pos     6.2 GB
>

They shouldn't be *completely* the same, but for your test (where the
terms dictionary etc is relatively small), they should be very close?

Standard 4.0 is still using the same underlying vByte compression, etc
as the 3.0 index format, though it has some major changes in other
places (e.g. terms dict)

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message