Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 76429 invoked from network); 16 Jun 2005 15:56:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 16 Jun 2005 15:56:36 -0000 Received: (qmail 62416 invoked by uid 500); 16 Jun 2005 15:56:29 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 62385 invoked by uid 500); 16 Jun 2005 15:56:28 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 62369 invoked by uid 99); 16 Jun 2005 15:56:28 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from plesk2.axarnet.com (HELO plesk1.axarnet.com) (213.200.101.2) by apache.org (qpsmtpd/0.28) with ESMTP; Thu, 16 Jun 2005 08:56:27 -0700 Received: (qmail 8750 invoked from network); 16 Jun 2005 15:55:22 -0000 Received: from 172.red-217-127-83.pooles.rima-tde.net (HELO ?192.168.1.5?) (217.127.83.172) by iil.es with SMTP; 16 Jun 2005 15:55:22 -0000 Message-ID: <42B1A106.90202@xeridia.com> Date: Thu, 16 Jun 2005 17:55:50 +0200 From: =?ISO-8859-1?Q?Diego_Manilla_Su=E1rez?= User-Agent: Mozilla Thunderbird 0.9 (Windows/20041103) X-Accept-Language: en-us, en MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Grouping search results Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hi! I'm trying to group the search results, just like when Google shows sub-results within the same domain than the main result. In my case, I need to index contents and their attached files. The ideal behaviour would be that, if there is a match in one of the associated files, the main result would be the content that file is attached to, and below it, the link to the file. If there are matches in several documents attached to the same content and/or the content itself, they all would have to be shown under their parent content. Example: the content "MyContent" has 3 pdf's attached to it. The user queries Lucene and there are matches in MyContent and 2 of its attached files. The result would be: - "My Content title" - link to file 1 - link to file 3 (file 2 didn't have any matches) - Subsequent matches... Now, I don't know what's the best way to implement this using Lucene. I suppose that some kind of post-processing is needed, so I would appreciate any tips about how to minimize the overhead. Thanks in advance. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org