hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "David Mollitor (Jira)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-20827) Inconsistent results for empty arrays
Date Fri, 12 Jun 2020 16:35:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-20827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17134363#comment-17134363
] 

David Mollitor commented on HIVE-20827:
---------------------------------------

[~teddy.choi] Possible to make a branch-3 / branch-2 backport?

> Inconsistent results for empty arrays
> -------------------------------------
>
>                 Key: HIVE-20827
>                 URL: https://issues.apache.org/jira/browse/HIVE-20827
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Teddy Choi
>            Assignee: Teddy Choi
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.0.0
>
>         Attachments: HIVE-20827.1.patch
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> LazySimpleDeserializeRead parses an empty array wrong. For example, a line ',' in a text
file table with a delimiter ',' and schema 'array<int>, array<array<string>>' shows
\[null\], \[\[""\]\], instead of \[\], \[\] with MapReduce engine and vectorized execution
enabled. LazySimpleDeserializeRead has following code; 
> {code:java}
> switch (complexField.complexCategory) {
> case LIST:
>   {
>     // Allow for empty string, etc.
>     final boolean isNext = (fieldPosition <= complexFieldEnd);
> {code}
> Empty string value read should be only applied to string families, not to other data
types. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message