Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 6872 invoked from network); 3 Nov 2006 18:57:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 3 Nov 2006 18:57:24 -0000 Received: (qmail 90159 invoked by uid 500); 3 Nov 2006 18:57:29 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 90127 invoked by uid 500); 3 Nov 2006 18:57:29 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 90107 invoked by uid 99); 3 Nov 2006 18:57:29 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Nov 2006 10:57:29 -0800 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of peterlkeegan@gmail.com designates 66.249.92.174 as permitted sender) Received: from [66.249.92.174] (HELO ug-out-1314.google.com) (66.249.92.174) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Nov 2006 10:57:15 -0800 Received: by ug-out-1314.google.com with SMTP id k40so506349ugc for ; Fri, 03 Nov 2006 10:56:53 -0800 (PST) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=YkDSWzDqwQi7KPGzDWgpeZlt2xiQpBJoX1l+8q+M98p8XMBkDsLuEfybBBsHzKNxmUCWkuVqsK2KBC364jv9/41RGxtkmnRbUadMo23MWlFSwosOq+14H7Q9FMJc5w98OWCkJ39QEFMb71QZzwWVsOtHEk9qgPWZBMnjMpMhajc= Received: by 10.67.101.10 with SMTP id d10mr3187768ugm.1162580213266; Fri, 03 Nov 2006 10:56:53 -0800 (PST) Received: by 10.66.239.7 with HTTP; Fri, 3 Nov 2006 10:56:53 -0800 (PST) Message-ID: Date: Fri, 3 Nov 2006 13:56:53 -0500 From: "Peter Keegan" To: java-user@lucene.apache.org Subject: Re: Announcement: Lucene powering Monster job search index (Beta) In-Reply-To: <1c36b0de0611030801u320dff57k1a470eb93c239441@mail.gmail.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_32533_968963.1162580213162" References: <1c36b0de0611030801u320dff57k1a470eb93c239441@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_32533_968963.1162580213162 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline Daniel, Yes, this is correct if you happen to be doing a radius search and sorting by mileage. Peter On 11/3/06, Daniel Rosher wrote: > > Hi Peter, > > Does this mean you are calculating the euclidean distance twice ... once > for > the HitCollecter to filter > 'out of range' documents, and then again for the custom Comparator to sort > the returned documents? > especially since the filtering is done outside Lucene? > > Regards, > Dan > > > >Joe, > > > >Fields with numeric values are stored in a separate file as binary values > in > >an internal format. Lucene is unaware of this file and unaware of the > range > >expression in the query. The range expression is parsed outside of Lucene > >and used in a custom HitCollector to filter out documents that aren't in > the > >requested range(s). A goal was to do this without having to modify > Lucene. > >Our scheme is pretty efficient, but not very general purpose in its > current > >form, though. > > > >Peter > > > > > >On 10/30/06, Joe Shaw wrote: > >> > >> Hi Peter, > >> > >> On Fri, 2006-10-27 at 15:29 -0400, Peter Keegan wrote: > >> > Numeric range search is one of Lucene's weak points > (performance-wise) > >> so we > >> > have implemented this with a custom HitCollector and an extension to > the > >> > Lucene index files that stores the numeric field values for all > >> documents. > >> > > >> > It is important to point out that this has all been implemented with > the > >> > stock Lucene 2.0 library. No code changes were made to the Lucene > core. > >> > >> Can you give some technical details on the extension to the Lucene > index > >> files? How did you do it without making any changes to the Lucene > core? > >> > >> Thanks, > >> Joe > >> > >> > >> --------------------------------------------------------------------- > >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > >> For additional commands, e-mail: java-user-help@lucene.apache.org > >> > >> > > ------=_Part_32533_968963.1162580213162--