hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sergey Shelukhin (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-11777) implement an option to have single ETL strategy for multiple directories
Date Thu, 12 Nov 2015 19:18:11 GMT

    [ https://issues.apache.org/jira/browse/HIVE-11777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15002689#comment-15002689
] 

Sergey Shelukhin commented on HIVE-11777:
-----------------------------------------

Hmm. I'll see if it's easy to add unit tests.

> implement an option to have single ETL strategy for multiple directories
> ------------------------------------------------------------------------
>
>                 Key: HIVE-11777
>                 URL: https://issues.apache.org/jira/browse/HIVE-11777
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Sergey Shelukhin
>            Assignee: Sergey Shelukhin
>         Attachments: HIVE-11777.01.patch, HIVE-11777.02.patch, HIVE-11777.03.patch, HIVE-11777.04.patch,
HIVE-11777.05.patch, HIVE-11777.patch
>
>
> In case of metastore footer PPD we don't want to call PPD call with all attendant SARG,
MS and HBase overhead for each directory. If we wait for some time (10ms? some fraction of
inputs?) we can do one call without losing overall perf. 
> For now make it time based.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message