accumulo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Flester <fles...@gmail.com>
Subject Re: Setting Charset in getBytes() call.
Date Mon, 29 Oct 2012 19:14:28 GMT
> UTF-8 should always be present (according to the JLS), and as a multi-byte
> format should be able to encode any character that you would need to.
>

UTF-8 cannot encode arbitrary data. All data that we store in accumulo
is not characters. A safe encoding to use as a pass through when you
don't know if you are dealing with characters is ISO-8859-1 since we know
that we can make the round trip from bytes to string to bytes without loss.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message