lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Rutherglen (JIRA)" <>
Subject [jira] Commented: (LUCENE-1476) BitVector implement DocIdSet, IndexReader returns DocIdSet deleted docs
Date Tue, 27 Jan 2009 16:24:59 GMT


Jason Rutherglen commented on LUCENE-1476:

The percentage performance decrease in the previous 
results is 99%.  

Jason can you format those results using a Jira table? 

Perhaps this should be an option in the benchmark output?

M.M. LUCENE-1516 comment: "I think the larger number of
[harder-for-cpu-to-predict] if statements may be the cause of the
slowdown once %tg deletes gets high enough?" 

I have been looking at the performance with YourKit and don't have
any conclusions yet. The main difference between using skipto and 
BV.get is the if statements and some added method calls, which even 
if they are inlined I suspect will not make up the difference.

Next steps: 
1. Deletes as a NOT boolean query which probably should
be it's own patch 
2.  Pluggable alternative representations such as
OpenBitSet and int array, part of this patch? 

> BitVector implement DocIdSet, IndexReader returns DocIdSet deleted docs
> -----------------------------------------------------------------------
>                 Key: LUCENE-1476
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>    Affects Versions: 2.4
>            Reporter: Jason Rutherglen
>            Priority: Trivial
>         Attachments: LUCENE-1476.patch, LUCENE-1476.patch, LUCENE-1476.patch, quasi_iterator_deletions.diff,
quasi_iterator_deletions_r2.diff, searchdeletes.alg
>   Original Estimate: 12h
>  Remaining Estimate: 12h
> Update BitVector to implement DocIdSet.  Expose deleted docs DocIdSet from IndexReader.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message