hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <>
Subject [jira] [Commented] (HIVE-6382) PATCHED_BLOB encoding in ORC will corrupt data in some cases
Date Thu, 20 Feb 2014 15:35:19 GMT


Hive QA commented on HIVE-6382:

{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:

{color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 5172 tests executed
*Failed tests:*

Test results:
Console output:

Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 1 tests failed

This message is automatically generated.


> PATCHED_BLOB encoding in ORC will corrupt data in some cases
> ------------------------------------------------------------
>                 Key: HIVE-6382
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>          Components: Serializers/Deserializers
>    Affects Versions: 0.13.0
>            Reporter: Prasanth J
>            Assignee: Prasanth J
>              Labels: orcfile
>         Attachments: HIVE-6382.1.patch, HIVE-6382.2.patch, HIVE-6382.3.patch, HIVE-6382.4.patch,
HIVE-6382.5.patch, HIVE-6382.6.patch
> In PATCHED_BLOB encoding (added in HIVE-4123), gapVsPatchList is an array of long that
stores gap (g) between the values that are patched and the patch value (p). The maximum distance
of gap can be 511 that require 8 bits to encode. And patch values can take more than 56 bits.
When patch values take more than 56 bits, p + g will become > 64 bits which cannot be packed
to a long. This will result in data corruption under the case where patch values are >
56 bits. 

This message was sent by Atlassian JIRA

View raw message