hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Raghotham Murthy (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-337) LazySimpleSerDe should support array and map types
Date Wed, 11 Mar 2009 21:48:50 GMT

    [ https://issues.apache.org/jira/browse/HIVE-337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12681050#action_12681050
] 

Raghotham Murthy commented on HIVE-337:
---------------------------------------

I am not sure what you mean by null array.

Given an array column, ideally, we should distinguish between the following cases (I am repeating
them for clarity):

1. NULL - array column is null (is this what you mean by null array?)
2. [NULL] - array containing one element (NULL)
3. [''] - array containing one element (empty string)
4. [] - array containing no elements

Is there are plan for LazySimpleSerDe to support nested arrays? If so, we cant really have
a single delimiter for arrays and maps. We should introduce array begin and end markers in
the serialization format. Alternatively, we could store the number of bytes in the array before
the array column value itself.

> LazySimpleSerDe should support array and map types
> --------------------------------------------------
>
>                 Key: HIVE-337
>                 URL: https://issues.apache.org/jira/browse/HIVE-337
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Serializers/Deserializers
>    Affects Versions: 0.2.0
>            Reporter: Zheng Shao
>            Assignee: Zheng Shao
>            Priority: Blocker
>
> Once we do that, we can completely deprecate DynamicSerDe/TCTLSeparatedProtocol, and
close any bugs that DynamicSerDe/TCTLSeparatedProtocol has.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message