lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jim Musil (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-7036) Faster method for group.facet
Date Wed, 11 Nov 2015 20:23:11 GMT

    [ https://issues.apache.org/jira/browse/SOLR-7036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001020#comment-15001020
] 

Jim Musil commented on SOLR-7036:
---------------------------------

I'm still working on this. Quite a bit changed under the hood regarding UninvertedFields,
so it has not been a quick translation from 4.x to 5.x for this patch.



> Faster method for group.facet
> -----------------------------
>
>                 Key: SOLR-7036
>                 URL: https://issues.apache.org/jira/browse/SOLR-7036
>             Project: Solr
>          Issue Type: Improvement
>          Components: faceting
>    Affects Versions: 4.10.3
>            Reporter: Jim Musil
>            Assignee: Erick Erickson
>             Fix For: 5.4, Trunk
>
>         Attachments: SOLR-7036.patch, SOLR-7036.patch
>
>
> This is a patch that speeds up the performance of requests made with group.facet=true.
The original code that collects and counts unique facet values for each group does not use
the same improved field cache methods that have been added for normal faceting in recent versions.
> Specifically, this approach leverages the UninvertedField class which provides a much
faster way to look up docs that contain a term. I've also added a simple grouping map so that
when a term is found for a doc, it can quickly look up the group to which it belongs.
> Group faceting was very slow for our data set and when the number of docs or terms was
high, the latency spiked to multiple second requests. This solution provides better overall
performance -- from an average of 54ms to 32ms. It also dropped our slowest performing queries
way down -- from 6012ms to 991ms.
> I also added a few tests.
> I added an additional parameter so that you can choose to use this method or the original.
Add group.facet.method=fc to use the improved method or group.facet.method=original which
is the default if not specified.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message