Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: java-user@lucene.apache.org
Received-SPF: pass (hermes.apache.org: domain of yseeley@gmail.com designates
 64.233.170.195 as permitted sender)
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws;
        s=beta; d=gmail.com;
        h=received:message-id:date:from:reply-to:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:references;
        b=flzQxPN28wXhVdNWFMFKXSSHikFNwLbBF+aF92vkTLXeF0zyAcVELgzm3nqE0vvNyptZiCunbWw3ntXTIaNDEdumikfvnP2rpVvalItzVNG5YXaTnQVh8V/ccgWN4ZUN9MTV89uSPOIeE0x4tdAvrbfJCIlX64OlkE3BFb08lsk=
Message-ID: <c68e391705040512593880b3e2@mail.gmail.com>
Date: Tue, 5 Apr 2005 15:59:39 -0400
From: Yonik Seeley <yseeley@gmail.com>
Reply-To: Yonik Seeley <yseeley@gmail.com>
To: java-user@lucene.apache.org
Subject: Re: QueryParser: open ended range queries
In-Reply-To: <029071497e041081e3097248b0c49cc9@ehatchersolutions.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
References: <c68e391705040511493f63b4a5@mail.gmail.com>
	 <029071497e041081e3097248b0c49cc9@ehatchersolutions.com>

For numeric fields, this will never happen.
For text fields, I could either
 1) just use the first token generated (yuck)
  2) don't run it through the analyzer (v1.0)
  3) run it through an analyzer specific to range and prefix queries (post =
v1.0)

Since I know the schema, I can pick and choose different methods for
different field types.  Generic lucene isn't as lucky and has to guess
(hence the ugly try-to-parse-as-a-date code).

An example of why option3 may be needed: consider the recently posted
ISOLatinFilter that stripps accents.  If one indexes text:appl=E9, and
it gets indexed as text:apple, then a range query of text:[appl=E9 TO
orange] won't find that document.

Of course you just can't run it through the normal analyzer either
since then text:[a to z] probably won't work (a will get stopped out,
etc).  Also, the normal analyzer may expand things into synonyms, etc.

-Yonik


On Apr 5, 2005 3:43 PM, Erik Hatcher <erik@ehatchersolutions.com> wrote:
>=20
> On Apr 5, 2005, at 2:49 PM, Yonik Seeley wrote:
> > Just curious.  I plan on overriding the current getRangeQuery() anyway
> > since it currently doesn't run the endpoints through the analyzer.
>=20
> What will you do when multiple tokens are returned from the analyzer?
>=20
>         Erik

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org