hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-13291) ORC BI Split strategy should consider block size instead of file size
Date Wed, 23 Mar 2016 06:54:25 GMT

     [ https://issues.apache.org/jira/browse/HIVE-13291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Prasanth Jayachandran updated HIVE-13291:
-----------------------------------------
    Attachment: HIVE-13291-branch-1.patch

Committed the attached patch to branch-1.

> ORC BI Split strategy should consider block size instead of file size
> ---------------------------------------------------------------------
>
>                 Key: HIVE-13291
>                 URL: https://issues.apache.org/jira/browse/HIVE-13291
>             Project: Hive
>          Issue Type: Bug
>          Components: ORC
>    Affects Versions: 2.1.0
>            Reporter: Gopal V
>            Assignee: Prasanth Jayachandran
>             Fix For: 1.3.0, 2.1.0
>
>         Attachments: HIVE-13291-branch-1.patch, HIVE-13291.1.patch, HIVE-13291.2.patch,
HIVE-13291.3.patch
>
>
> When we force split strategy to use "BI" (using hive.exec.orc.split.strategy), entire
file is considered as single split. This might be inefficient when the files are large. Instead,
BI should consider splitting at block boundary. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message