hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joydeep Sen Sarma (JIRA)" <>
Subject [jira] Commented: (HIVE-2051) getInputSummary() to call FileSystem.getContentSummary() in parallel
Date Sun, 20 Mar 2011 14:32:29 GMT


Joydeep Sen Sarma commented on HIVE-2051:

based on:

it seems that the right thing to do here is to catch the interruptedexception and then call
Thread.currentThread.interrupt() (grep for 'swallow interrupt' in this article).

we could also rethrow it - but the problem then will merely be punted to the higher layer
(which probably will ignore it as well)

> getInputSummary() to call FileSystem.getContentSummary() in parallel
> --------------------------------------------------------------------
>                 Key: HIVE-2051
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Siying Dong
>            Assignee: Siying Dong
>            Priority: Minor
>         Attachments: HIVE-2051.1.patch, HIVE-2051.2.patch, HIVE-2051.3.patch, HIVE-2051.4.patch
> getInputSummary() now call FileSystem.getContentSummary() one by one, which can be extremely
slow when the number of input paths are huge. By calling those functions in parallel, we can
cut latency in most cases.

This message is automatically generated by JIRA.
For more information on JIRA, see:

View raw message