lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-1410) PFOR implementation
Date Tue, 21 Dec 2010 17:50:04 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12973790#action_12973790
] 

Robert Muir commented on LUCENE-1410:
-------------------------------------

On LUCENE-2723, I uploaded a "bulk vint" codec that shares most of the same codepath as FOR/PFOR,
except it writes blocks of 128 vint-encoded integers.

There are performance numbers there compared to our Standard vint-based codec, as you can
see
it differs dramatically due to other reasons.

So I thought it would be useful to then compare FOR to this, since its a good measure of just
the compression
algorithm, but everything else is the same (comparing two 128-block size FixedIntBlock codecs
with the same 
index layout, etc etc). This way we compare apples to apples.
 
||Query||QPS BulkVInt||QPS FOR||Pct diff||||
|united~1.0|9.43|9.39|{color:red}-0.5%{color}|
|united~2.0|2.02|2.02|{color:red}-0.3%{color}|
|unit~1.0|6.37|6.36|{color:red}-0.1%{color}|
|unit~2.0|6.13|6.21|{color:green}1.2%{color}|
|"unit state"~3|3.45|3.51|{color:green}2.0%{color}|
|spanNear([unit, state], 10, true)|2.89|2.99|{color:green}3.3%{color}|
|unit*|30.04|31.42|{color:green}4.6%{color}|
|unit state|8.00|8.40|{color:green}5.0%{color}|
|"unit state"|5.97|6.37|{color:green}6.7%{color}|
|spanFirst(unit, 5)|11.29|12.10|{color:green}7.2%{color}|
|uni*|17.36|18.69|{color:green}7.6%{color}|
|+unit +state|10.99|12.18|{color:green}10.8%{color}|
|+nebraska +state|65.74|73.06|{color:green}11.1%{color}|
|state|28.90|32.37|{color:green}12.0%{color}|
|u*d|10.54|12.45|{color:green}18.1%{color}|
|un*d|40.06|47.61|{color:green}18.9%{color}|


> PFOR implementation
> -------------------
>
>                 Key: LUCENE-1410
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1410
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Index
>            Reporter: Paul Elschot
>            Priority: Minor
>             Fix For: Bulk Postings branch
>
>         Attachments: autogen.tgz, for-summary.txt, LUCENE-1410-codecs.tar.bz2, LUCENE-1410.patch,
LUCENE-1410.patch, LUCENE-1410.patch, LUCENE-1410.patch, LUCENE-1410b.patch, LUCENE-1410c.patch,
LUCENE-1410d.patch, LUCENE-1410e.patch, TermQueryTests.tgz, TestPFor2.java, TestPFor2.java,
TestPFor2.java
>
>   Original Estimate: 21840h
>  Remaining Estimate: 21840h
>
> Implementation of Patched Frame of Reference.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message