hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zheng Shao (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-136) SerDe should escape some special characters
Date Thu, 05 Feb 2009 20:35:59 GMT

    [ https://issues.apache.org/jira/browse/HIVE-136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12670894#action_12670894
] 

Zheng Shao commented on HIVE-136:
---------------------------------

In HIVE-270 I put a link to how MySQL does escaping. That seems to be a clean solution although
it won't support string.split or even splitting lines using "\n".

If we change "\005" with "\\" in your case, it won't work with the symmetric escaping/unescaping
logic as well.

What about this? We explicitly allow users to enable/disable escaping. If the user disable
it, then we just do nothing (and the data may look corrupt to them - but that's the only thing
we can do). If the user enable it, then we do the escaping/unescaping logic as I mentioned
(not symmetric, but try to guess what the user really means).


> SerDe should escape some special characters
> -------------------------------------------
>
>                 Key: HIVE-136
>                 URL: https://issues.apache.org/jira/browse/HIVE-136
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Serializers/Deserializers
>            Reporter: Zheng Shao
>            Priority: Critical
>
> MetadataTypedColumnsetSerDe and DynamicSerDe should escape some special characters like
'\n' or the column/item/key separator.
> Otherwise the data will look corrupted.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message