lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daniel Naber <lucenel...@danielnaber.de>
Subject Re: Lucene does NOT use UTF-8
Date Mon, 29 Aug 2005 21:56:01 GMT
On Monday 29 August 2005 19:56, Ken Krugler wrote:

> "Lucene writes strings as a VInt representing the length of the
> string in Java chars (UTF-16 code units), followed by the character
> data."

But wouldn't UTF-16 mean 2 bytes per character? That doesn't seem to be the 
case.

Regards
 Daniel

-- 
http://www.danielnaber.de

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message