Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 17012 invoked from network); 5 Apr 2005 19:59:55 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 5 Apr 2005 19:59:55 -0000 Received: (qmail 32316 invoked by uid 500); 5 Apr 2005 19:59:46 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 32288 invoked by uid 500); 5 Apr 2005 19:59:46 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 32275 invoked by uid 99); 5 Apr 2005 19:59:46 -0000 X-ASF-Spam-Status: No, hits=0.4 required=10.0 tests=DNS_FROM_RFC_ABUSE,RCVD_BY_IP,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: domain of yseeley@gmail.com designates 64.233.170.195 as permitted sender) Received: from rproxy.gmail.com (HELO rproxy.gmail.com) (64.233.170.195) by apache.org (qpsmtpd/0.28) with ESMTP; Tue, 05 Apr 2005 12:59:44 -0700 Received: by rproxy.gmail.com with SMTP id b11so1620902rne for ; Tue, 05 Apr 2005 12:59:40 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:references; b=flzQxPN28wXhVdNWFMFKXSSHikFNwLbBF+aF92vkTLXeF0zyAcVELgzm3nqE0vvNyptZiCunbWw3ntXTIaNDEdumikfvnP2rpVvalItzVNG5YXaTnQVh8V/ccgWN4ZUN9MTV89uSPOIeE0x4tdAvrbfJCIlX64OlkE3BFb08lsk= Received: by 10.38.67.50 with SMTP id p50mr6769180rna; Tue, 05 Apr 2005 12:59:39 -0700 (PDT) Received: by 10.38.12.53 with HTTP; Tue, 5 Apr 2005 12:59:39 -0700 (PDT) Message-ID: Date: Tue, 5 Apr 2005 15:59:39 -0400 From: Yonik Seeley Reply-To: Yonik Seeley To: java-user@lucene.apache.org Subject: Re: QueryParser: open ended range queries In-Reply-To: <029071497e041081e3097248b0c49cc9@ehatchersolutions.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable References: <029071497e041081e3097248b0c49cc9@ehatchersolutions.com> X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N For numeric fields, this will never happen. For text fields, I could either 1) just use the first token generated (yuck) 2) don't run it through the analyzer (v1.0) 3) run it through an analyzer specific to range and prefix queries (post = v1.0) Since I know the schema, I can pick and choose different methods for different field types. Generic lucene isn't as lucky and has to guess (hence the ugly try-to-parse-as-a-date code). An example of why option3 may be needed: consider the recently posted ISOLatinFilter that stripps accents. If one indexes text:appl=E9, and it gets indexed as text:apple, then a range query of text:[appl=E9 TO orange] won't find that document. Of course you just can't run it through the normal analyzer either since then text:[a to z] probably won't work (a will get stopped out, etc). Also, the normal analyzer may expand things into synonyms, etc. -Yonik On Apr 5, 2005 3:43 PM, Erik Hatcher wrote: >=20 > On Apr 5, 2005, at 2:49 PM, Yonik Seeley wrote: > > Just curious. I plan on overriding the current getRangeQuery() anyway > > since it currently doesn't run the endpoints through the analyzer. >=20 > What will you do when multiple tokens are returned from the analyzer? >=20 > Erik --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org