lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian Goetz <br...@quiotix.com>
Subject Re: searching words starting with accent characters using UTF-8
Date Sat, 29 Dec 2001 22:22:15 GMT
> I find it is also necessary to adapt the QueryParser for accented
> characters. My approach is
> simply to add é,è,ç,à,... into the the #_TERM_CHAR and #_TERM_START_CHAR
> character
> sets. My question is: what is the purpose of adding in all the characters:
> "\u0080"-"\uFFFE" which
> I find in the current source?

So that we didn't have to add them one by one, when we invariably missed
one.

--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message