Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1DB59932A for ; Thu, 28 Jun 2012 04:41:52 +0000 (UTC) Received: (qmail 570 invoked by uid 500); 28 Jun 2012 04:41:51 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 99637 invoked by uid 500); 28 Jun 2012 04:41:49 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 99340 invoked by uid 500); 28 Jun 2012 04:41:47 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 99274 invoked by uid 99); 28 Jun 2012 04:41:46 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Jun 2012 04:41:46 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id 188A814283D for ; Thu, 28 Jun 2012 04:41:46 +0000 (UTC) Date: Thu, 28 Jun 2012 04:41:46 +0000 (UTC) From: "Edward Capriolo (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: <129841515.65427.1340858506105.JavaMail.jiratomcat@issues-vm> In-Reply-To: <825257501.65259.1340854903997.JavaMail.jiratomcat@issues-vm> Subject: [jira] [Commented] (HIVE-3206) Bucket mapjoin in trunk is not working MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-3206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402840#comment-13402840 ] Edward Capriolo commented on HIVE-3206: --------------------------------------- @Navis is there any way we can unit test here. This slipped through the unit testing. > Bucket mapjoin in trunk is not working > --------------------------------------- > > Key: HIVE-3206 > URL: https://issues.apache.org/jira/browse/HIVE-3206 > Project: Hive > Issue Type: Bug > Components: Query Processor > Affects Versions: 0.10.0 > Reporter: Navis > Assignee: Navis > > Bucket mapjoin throws exception archiving stored hashtables. > {noformat} > hive> set hive.optimize.bucketmapjoin = true; > hive> select /*+mapjoin(a)*/ a.key, a.value, b.value > > from srcbucket_mapjoin_part a join srcbucket_mapjoin_part_2 b > > on a.key=b.key; > Total MapReduce jobs = 1 > 12/06/28 12:36:18 WARN conf.HiveConf: DEPRECATED: Ignoring hive-default.xml found on the CLASSPATH at /home/navis/hive/conf/hive-default.xml > Execution log at: /tmp/navis/navis_20120628123636_5298a863-605c-4b98-bbb3-0a132c85c5a3.log > 2012-06-28 12:36:18 Starting to launch local task to process map join; maximum memory = 932118528 > 2012-06-28 12:36:18 Processing rows: 153 Hashtable size: 153 Memory usage: 1771376 rate: 0.002 > 2012-06-28 12:36:18 Dump the hashtable into file: file:/tmp/navis/hive_2012-06-28_12-36-17_003_3016196240171705142/-local-10002/HashTable-Stage-1/MapJoin-a-00-srcbucket22.txt.hashtable > 2012-06-28 12:36:18 Upload 1 File to: file:/tmp/navis/hive_2012-06-28_12-36-17_003_3016196240171705142/-local-10002/HashTable-Stage-1/MapJoin-a-00-srcbucket22.txt.hashtable File size: 9644 > 2012-06-28 12:36:19 Processing rows: 309 Hashtable size: 156 Memory usage: 1844568 rate: 0.002 > 2012-06-28 12:36:19 Dump the hashtable into file: file:/tmp/navis/hive_2012-06-28_12-36-17_003_3016196240171705142/-local-10002/HashTable-Stage-1/MapJoin-a-00-srcbucket23.txt.hashtable > 2012-06-28 12:36:19 Upload 1 File to: file:/tmp/navis/hive_2012-06-28_12-36-17_003_3016196240171705142/-local-10002/HashTable-Stage-1/MapJoin-a-00-srcbucket23.txt.hashtable File size: 10023 > 2012-06-28 12:36:19 End of local task; Time Taken: 0.773 sec. > Execution completed successfully > Mapred Local Task Succeeded . Convert the Join into MapJoin > Mapred Local Task Succeeded . Convert the Join into MapJoin > Launching Job 1 out of 1 > Number of reduce tasks is set to 0 since there's no reduce operator > java.io.IOException: This archives contains unclosed entries. > at org.apache.commons.compress.archivers.tar.TarArchiveOutputStream.finish(TarArchiveOutputStream.java:214) > at org.apache.hadoop.hive.common.FileUtils.tar(FileUtils.java:276) > at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:391) > at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:137) > at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:134) > at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57) > at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1324) > at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1110) > at org.apache.hadoop.hive.ql.Driver.run(Driver.java:944) > at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:258) > at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:215) > at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:406) > at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:744) > at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:607) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.hadoop.util.RunJar.main(RunJar.java:186) > Job Submission failed with exception 'java.io.IOException(This archives contains unclosed entries.)' > java.lang.IllegalArgumentException: Can not create a Path from an empty string > at org.apache.hadoop.fs.Path.checkPathArg(Path.java:82) > at org.apache.hadoop.fs.Path.(Path.java:90) > at org.apache.hadoop.hive.ql.exec.Utilities.getHiveJobID(Utilities.java:380) > at org.apache.hadoop.hive.ql.exec.Utilities.clearMapRedWork(Utilities.java:193) > at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:460) > at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:137) > at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:134) > at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57) > at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1324) > at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1110) > at org.apache.hadoop.hive.ql.Driver.run(Driver.java:944) > at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:258) > at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:215) > at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:406) > at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:744) > at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:607) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.hadoop.util.RunJar.main(RunJar.java:186) > FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MapRedTask > {noformat} > Seemed to be regression from HIVE-3128. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira