hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Olga Natkovich (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-63) PigStorage does not properly handle UTF8 data
Date Tue, 15 Jan 2008 00:36:34 GMT
PigStorage does not properly handle UTF8 data

                 Key: PIG-63
                 URL: https://issues.apache.org/jira/browse/PIG-63
             Project: Pig
          Issue Type: Bug
            Reporter: Olga Natkovich

>From Ben:

I just checked the code and the problem seems to be PigStorage. getNext() uses
readLine() which does not handle UTF8 correctly. putNext() also uses default encoder rather
than UTF8 explicitly.

Internally and in BinStorage UTF8 appears to be handled correctly.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message