Return-Path: Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: (qmail 57686 invoked from network); 22 Jun 2010 21:01:21 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 22 Jun 2010 21:01:21 -0000 Received: (qmail 44392 invoked by uid 500); 22 Jun 2010 21:01:20 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 44142 invoked by uid 500); 22 Jun 2010 21:01:19 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 44134 invoked by uid 99); 22 Jun 2010 21:01:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 22 Jun 2010 21:01:19 +0000 X-ASF-Spam-Status: No, hits=-1537.5 required=10.0 tests=ALL_TRUSTED,AWL X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 22 Jun 2010 21:01:18 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o5ML0wiX021616 for ; Tue, 22 Jun 2010 21:00:58 GMT Message-ID: <16261347.21711277240458238.JavaMail.jira@thor> Date: Tue, 22 Jun 2010 17:00:58 -0400 (EDT) From: "Martijn van Groningen (JIRA)" To: dev@lucene.apache.org Subject: [jira] Updated: (LUCENE-1421) Ability to group search results by field MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/LUCENE-1421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martijn van Groningen updated LUCENE-1421: ------------------------------------------ Attachment: lucene-grouping.patch This is an initial patch that allows result grouping with Lucene via a Collector and an attempt to integrate result grouping into Lucene / Solr. The collector can be used just like any other collector and returns TopDocs. The TopDocs contains GroupDoc instances, which is a subclass of ScoreDoc. I think this way it is easier to integrate grouping into existing code that uses Lucene (like Solr). I think that grouping code should be part of Lucene instead of Solr. I put the result grouping into a new contrib that I named grouping. Putting it in a contib seemed the right place for me. The patch doesn't contain any Solr code and I think a new issue in Solr should be opened for that. This patch is 'inspired by' by SOLR-236, but only contains its core functionality. Nonadjacent grouping based on field value with group counts. Also in the code i don't use the verb collapsing but grouping. This patch is also faster then the Solr variants. This because the grouping occurs whilst the documents are collected and thus saves multiple searches. Also the grouping algorithm itself is improved. Although this is work in progress any thought about this would be appriciated. > Ability to group search results by field > ---------------------------------------- > > Key: LUCENE-1421 > URL: https://issues.apache.org/jira/browse/LUCENE-1421 > Project: Lucene - Java > Issue Type: New Feature > Components: Search > Reporter: Artyom Sokolov > Priority: Minor > Attachments: lucene-grouping.patch > > > It would be awesome to group search results by specified field. Some functionality was provided for Apache Solr but I think it should be done in Core Lucene. There could be some useful information like total hits about collapsed data like total count and so on. > Thanks, > Artyom -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org