hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rui Li (JIRA)" <>
Subject [jira] [Commented] (HIVE-9367) CombineFileInputFormatShim#getDirIndices is expensive
Date Wed, 14 Jan 2015 02:54:35 GMT


Rui Li commented on HIVE-9367:

Hi [~jxiang], could you elaborate a little how this will avoid the expensive calls? Seems
we still have to iterate all the file statuses to check if it's a directory?

> CombineFileInputFormatShim#getDirIndices is expensive
> -----------------------------------------------------
>                 Key: HIVE-9367
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Jimmy Xiang
>            Assignee: Jimmy Xiang
>         Attachments: HIVE-9367.1.patch
> [~lirui] found out that we spent quite some time on CombineFileInputFormatShim#getDirIndices.
Looked into it and it seems to me we should be able to get rid of this method completely if
we can enhance CombineFileInputFormatShim a little.

This message was sent by Atlassian JIRA

View raw message