hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gopal V (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size
Date Wed, 16 Mar 2016 04:03:33 GMT

    [ https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15196717#comment-15196717
] 

Gopal V commented on HIVE-13291:
--------------------------------

Left some minor comments about that loop.

Approach LGTM - +1, tests pending.

> ORC BI Split strategy should consider block size instead of file size
> ---------------------------------------------------------------------
>
>                 Key: HIVE-13291
>                 URL: https://issues.apache.org/jira/browse/HIVE-13291
>             Project: Hive
>          Issue Type: Bug
>          Components: ORC
>    Affects Versions: 2.1.0
>            Reporter: Gopal V
>            Assignee: Prasanth Jayachandran
>         Attachments: HIVE-13291.1.patch, HIVE-13291.2.patch
>
>
> When we force split strategy to use "BI" (using hive.exec.orc.split.strategy), entire
file is considered as single split. This might be inefficient when the files are large. Instead,
BI should consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message