hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <>
Subject [jira] [Commented] (HIVE-6711) ORC maps uses getMapSize() from MapOI which is unreliable
Date Sun, 23 Mar 2014 18:34:45 GMT


Hive QA commented on HIVE-6711:

{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:

{color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 5440 tests executed
*Failed tests:*

Test results:
Console output:

Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 1 tests failed

This message is automatically generated.


> ORC maps uses getMapSize() from MapOI which is unreliable
> ---------------------------------------------------------
>                 Key: HIVE-6711
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.11.0, 0.12.0, 0.13.0
>            Reporter: Prasanth J
>            Assignee: Prasanth J
>              Labels: orcfile
>             Fix For: 0.13.0, 0.14.0
>         Attachments: HIVE-6711.1.patch
> HIVE-6707 had issues with map size. getMapSize() of LazyMap and LazyBinaryMap does not
deserialize the keys and count the number of unique keys. Since getMapSize() may return non-distinct
count of keys, the length of maps stored using ORC's map tree writer will not be in sync with
actual map size. As a result of this RLE reader will try to read beyond the disk range expecting
more map entries and will throw exception.

This message was sent by Atlassian JIRA

View raw message