From lucene-user-return-9015-apmail-jakarta-lucene-user-archive=jakarta.apache.org@jakarta.apache.org Thu Jul 01 14:16:06 2004 Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 36810 invoked from network); 1 Jul 2004 14:16:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 1 Jul 2004 14:16:06 -0000 Received: (qmail 85378 invoked by uid 500); 1 Jul 2004 14:16:00 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 85235 invoked by uid 500); 1 Jul 2004 14:15:59 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 85196 invoked by uid 99); 1 Jul 2004 14:15:58 -0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=HTML_40_50,HTML_MESSAGE X-Spam-Check-By: apache.org Received: from [64.65.202.3] (HELO contextmedia.com) (64.65.202.3) by apache.org (qpsmtpd/0.27.1) with ESMTP; Thu, 01 Jul 2004 07:15:51 -0700 Received: from ppeddi ([192.168.1.44]) by contextmedia.com ; Thu, 01 Jul 2004 10:14:02 -0400 Message-ID: <037501c45f75$e679f0a0$aa87a8c0@ppeddi> From: "Praveen Peddi" To: "lucenelist" Subject: Sorting and tokenization Date: Thu, 1 Jul 2004 10:15:45 -0400 MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_0372_01C45F54.5F59F8C0" X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2800.1409 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1409 X-Rcpt-To: X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N ------=_NextPart_000_0372_01C45F54.5F59F8C0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hello all, Now that lucene 1.4 rc3 has sorting functionality built in, I am adding = sorting functionality to our searching. Before posting any question to = this mailing list, I have been going thru most of the email responses in = this mailing list related to sorting. I have found that I cannot = tokenize the fields that I want to sort on. Lets take the example I have. I use lucene 1.3 final for searching. Sorting is in fact a very = important feature in our application. But we found that lucene does not = support out of box, we had to implement sorting by score and doc id = programatically which is kind of useless for us. So I thought lucene's = new sorting feature will best suit now. But unfortunately, the field = called "title" is tokenized currently. And this is done purposefully = because users would want to search partial matches (or rather search on = multiple words of the title). So if we make it un tokenized we may lose = an improtant functionality. My question is, is there any way I can achieve sorting the objects by = title and keeping title as tokenized? Thanks in advance. Praveen **************************************************************=20 Praveen Peddi Sr Software Engg, Context Media, Inc.=20 email:ppeddi@contextmedia.com=20 Tel: 401.854.3475=20 Fax: 401.861.3596=20 web: http://www.contextmedia.com=20 **************************************************************=20 Context Media- "The Leader in Enterprise Content Integration"=20 ------=_NextPart_000_0372_01C45F54.5F59F8C0--