lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Martijn van Groningen (JIRA)" <>
Subject [jira] Updated: (LUCENE-1421) Ability to group search results by field
Date Tue, 22 Jun 2010 21:00:58 GMT


Martijn van Groningen updated LUCENE-1421:

    Attachment: lucene-grouping.patch

This is an initial patch that allows result grouping with Lucene via a Collector and an attempt
to integrate result grouping into Lucene / Solr. The collector can be used just like any other
collector and returns TopDocs. The TopDocs contains GroupDoc instances, which is a subclass
of ScoreDoc. I think this way it is easier to integrate grouping into existing code that uses
Lucene (like Solr).

I think that grouping code should be part of Lucene instead of Solr. I put the result grouping
into a new contrib that I named grouping. Putting it in a contib seemed the right place for
me.  The patch doesn't contain any Solr code and I think a new issue in Solr should be opened
for that.

This patch is 'inspired by' by SOLR-236, but only contains its core functionality. Nonadjacent
grouping based on field value with group counts. Also in the code i don't use the verb collapsing
but grouping. This patch is also faster then the Solr variants. This because the grouping
occurs whilst the documents are collected and thus saves multiple searches.  Also the grouping
algorithm itself is improved. 

Although this is work in progress any thought about this would be appriciated.

> Ability to group search results by field
> ----------------------------------------
>                 Key: LUCENE-1421
>                 URL:
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Search
>            Reporter: Artyom Sokolov
>            Priority: Minor
>         Attachments: lucene-grouping.patch
> It would be awesome to group search results by specified field. Some functionality was
provided for Apache Solr but I think it should be done in Core Lucene. There could be some
useful information like total hits about collapsed data like total count and so on.
> Thanks,
> Artyom

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message