hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bc Wong <>
Subject Hive support for latin1
Date Mon, 02 Aug 2010 03:31:07 GMT
Hi all,

I'm trying to figure out how to query Hive on latin1 encoded data.

I created a file with 256 characters, with unicode value 0-255,
encoded in latin1. I made a table out of it. But when I do a "select
*", Hive returns the upper ascii rows as '\xef\xbf\xbd', which is the
replacement character '\ufffd' encoded in UTF-8.

Does anyone know how to work with non-UTF8 data?

bc Wong
Cloudera Software Engineer

View raw message