Return-Path: Delivered-To: apmail-hive-dev-archive@www.apache.org Received: (qmail 44767 invoked from network); 20 Mar 2011 14:32:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 20 Mar 2011 14:32:54 -0000 Received: (qmail 94627 invoked by uid 500); 20 Mar 2011 14:32:53 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 94597 invoked by uid 500); 20 Mar 2011 14:32:53 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 94589 invoked by uid 500); 20 Mar 2011 14:32:53 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 94586 invoked by uid 99); 20 Mar 2011 14:32:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Mar 2011 14:32:53 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Mar 2011 14:32:51 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 99FE34008E8 for ; Sun, 20 Mar 2011 14:32:29 +0000 (UTC) Date: Sun, 20 Mar 2011 14:32:29 +0000 (UTC) From: "Joydeep Sen Sarma (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: <2141367347.14392.1300631549627.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1816002944.14574.1299881039536.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Commented: (HIVE-2051) getInputSummary() to call FileSystem.getContentSummary() in parallel MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HIVE-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008903#comment-13008903 ] Joydeep Sen Sarma commented on HIVE-2051: ----------------------------------------- based on: http://www.ibm.com/developerworks/java/library/j-jtp05236.html it seems that the right thing to do here is to catch the interruptedexception and then call Thread.currentThread.interrupt() (grep for 'swallow interrupt' in this article). we could also rethrow it - but the problem then will merely be punted to the higher layer (which probably will ignore it as well) > getInputSummary() to call FileSystem.getContentSummary() in parallel > -------------------------------------------------------------------- > > Key: HIVE-2051 > URL: https://issues.apache.org/jira/browse/HIVE-2051 > Project: Hive > Issue Type: Improvement > Reporter: Siying Dong > Assignee: Siying Dong > Priority: Minor > Attachments: HIVE-2051.1.patch, HIVE-2051.2.patch, HIVE-2051.3.patch, HIVE-2051.4.patch > > > getInputSummary() now call FileSystem.getContentSummary() one by one, which can be extremely slow when the number of input paths are huge. By calling those functions in parallel, we can cut latency in most cases. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira