orc-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley" <omal...@apache.org>
Subject Re: "For dictionary encodings the dictionary is sorted"
Date Sat, 24 Dec 2016 21:46:56 GMT
On Mon, Dec 12, 2016 at 4:48 PM, Dain Sundstrom <dain@iq80.com> wrote:


> I meant that the sorting of the dictionary seems to be UTF-16 BE.  Is that
> not correct?


I believe the sorting of the dictionary is UTF-8, because the red-black
tree takes the binary representation from Hadoop's Text instead of Java's
String.

.. Owen

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message