lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-9599) DocValues performance regression with new iterator API
Date Tue, 18 Oct 2016 02:57:58 GMT

     [ https://issues.apache.org/jira/browse/SOLR-9599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Yonik Seeley updated SOLR-9599:
-------------------------------
    Description: 
I did a quick performance comparison of faceting indexed fields (i.e. docvalues are not stored)
using method=dv before and after the new docvalues iterator went in (LUCENE-7407).

5M document index, 21 segments, single valued string fields w/ no missing values.

|| field cardinality || new_time / old_time ||
|10|2.01|
|1000|2.02|
|10000|1.85|
|100000|1.56|
|1000000|1.31|

So unfortunately, often twice as slow.

See followup messages for tests using real docvalues as well.

  was:
I did a quick performance comparison of faceting indexed fields (i.e. docvalues are not stored)
using method=dv before and after the new docvalues iterator went in (LUCENE-7407).

5M document index, 21 segments, single valued string fields w/ no missing values.

|| field cardinality || new_time / old_time ||
|10|2.01|
|1000|2.02|
|10000|1.85|
|100000|1.56|
|1000000|1.31|

So unfortunately, often twice as slow.


> DocValues performance regression with new iterator API
> ------------------------------------------------------
>
>                 Key: SOLR-9599
>                 URL: https://issues.apache.org/jira/browse/SOLR-9599
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>    Affects Versions: master (7.0)
>            Reporter: Yonik Seeley
>             Fix For: master (7.0)
>
>
> I did a quick performance comparison of faceting indexed fields (i.e. docvalues are not
stored) using method=dv before and after the new docvalues iterator went in (LUCENE-7407).
> 5M document index, 21 segments, single valued string fields w/ no missing values.
> || field cardinality || new_time / old_time ||
> |10|2.01|
> |1000|2.02|
> |10000|1.85|
> |100000|1.56|
> |1000000|1.31|
> So unfortunately, often twice as slow.
> See followup messages for tests using real docvalues as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message