lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dave Kor <s0454...@sms.ed.ac.uk>
Subject Re: Lucene does NOT use UTF-8.
Date Sun, 28 Aug 2005 22:23:52 GMT

http://java.sun.com/docs/books/tutorial/i18n/text/stream.html

Yes, its confusing. Sun calls its own encoding format as "Unicode" and the above
webpage talks about how to convert between Java's Unicode format and the UTF-8
format.

Its just a matter of specifying "UTF-8" when creating output streams. I may have
remembered wrongly, but I do seem to recall old Lucene code that was designed to
run on JDK 1.1 actually did this "UTF-8" conversion.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message