hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth J (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-4123) The RLE encoding for ORC can be improved
Date Fri, 08 Aug 2014 06:06:12 GMT

    [ https://issues.apache.org/jira/browse/HIVE-4123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14090350#comment-14090350
] 

Prasanth J commented on HIVE-4123:
----------------------------------

Please go ahead and update the original description. 
At this point the only possible valid values are 0.11 and 0.12. As you had mentioned if the
parameter is not defined or defined wrongly it will use the default 0.12 encoding. 

bq. Is that accurate? Can releases be specified as "0.12.0" or "0.13.1"?
Yes. Accurate. HIVE-6002 was trying to add patch number to the write version so that numbers
can be specified as 0.12.1. But I don't think it will be committed until next major change
to ORC writer.

> The RLE encoding for ORC can be improved
> ----------------------------------------
>
>                 Key: HIVE-4123
>                 URL: https://issues.apache.org/jira/browse/HIVE-4123
>             Project: Hive
>          Issue Type: New Feature
>          Components: File Formats
>    Affects Versions: 0.12.0
>            Reporter: Owen O'Malley
>            Assignee: Prasanth J
>              Labels: TODOC12, orcfile
>             Fix For: 0.12.0
>
>         Attachments: HIVE-4123-8.patch, HIVE-4123.1.git.patch.txt, HIVE-4123.2.git.patch.txt,
HIVE-4123.3.patch.txt, HIVE-4123.4.patch.txt, HIVE-4123.5.txt, HIVE-4123.6.txt, HIVE-4123.7.txt,
HIVE-4123.8.txt, HIVE-4123.8.txt, HIVE-4123.patch.txt, ORC-Compression-Ratio-Comparison.xlsx
>
>
> The run length encoding of integers can be improved:
> * tighter bit packing
> * allow delta encoding
> * allow longer runs



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message