lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bill Janssen <>
Subject Re: Lucene does NOT use UTF-8.
Date Sat, 27 Aug 2005 18:56:16 GMT
Thanks for pointing this out, Marvin.  I wish Sun (or someone) would
document and register this particular character set encoding with
IANA, so that it could be used outside of Java.  As it stands now,
it's essentially a bastard encoding, good for nothing, and one of the
warts of Java.

Lucene probably shouldn't be using it in its file formats.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message