lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: New codecs keep Freq skip/omit Pos
Date Fri, 22 Apr 2011 02:46:24 GMT
On Thu, Apr 21, 2011 at 9:52 PM, Alex vB <mail@avomberg.de> wrote:
>
> PforDelta W Freq W Pos         20.6 GB
> PforDelta W/O Freq W/O Pos               1.6 GB
> Standard 4.0 W Freq W Pos              28.1 GB
> Standard 4.0 W/O Freq W/O Pos    6.2 GB
> Pfor W Freq W Pos                         22 GB
> Pfor W/O Freq W/O Pos            3.1 GB
>

Hi, can you provide some more details on these index size numbers?
* which one is PforDelta versus Pfor? We have 2 PFOR-delta impls,
PatchedFrameOfRef and PatchedFrameOfRef2, that are slightly
different... I'm pretty curious about the huge size differential
between the two though (e.g. 1.6GB versus 3.1GB, can you give more
info/breakdown of file sizes?
* are you using a stopfilter at index time or are you indexing all
terms including stopwords?

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message