Return-Path: Delivered-To: apmail-hadoop-mapreduce-dev-archive@minotaur.apache.org Received: (qmail 62661 invoked from network); 2 Mar 2011 22:58:02 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 2 Mar 2011 22:58:02 -0000 Received: (qmail 66603 invoked by uid 500); 2 Mar 2011 22:58:00 -0000 Delivered-To: apmail-hadoop-mapreduce-dev-archive@hadoop.apache.org Received: (qmail 66550 invoked by uid 500); 2 Mar 2011 22:58:00 -0000 Mailing-List: contact mapreduce-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-dev@hadoop.apache.org Delivered-To: mailing list mapreduce-dev@hadoop.apache.org Received: (qmail 66431 invoked by uid 99); 2 Mar 2011 22:57:59 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Mar 2011 22:57:59 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Mar 2011 22:57:58 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 684854CF1C for ; Wed, 2 Mar 2011 22:57:37 +0000 (UTC) Date: Wed, 2 Mar 2011 22:57:37 +0000 (UTC) From: "Tsz Wo (Nicholas), SZE (JIRA)" To: mapreduce-dev@hadoop.apache.org Message-ID: <1104683189.9356.1299106657423.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Resolved: (MAPREDUCE-1712) HAR sequence files throw errors in MR jobs MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAPREDUCE-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tsz Wo (Nicholas), SZE resolved MAPREDUCE-1712. ----------------------------------------------- Resolution: Duplicate Seems that MAPREDUCE-1752 fixed this. Please feel free to reopen this if it is still a problem. > HAR sequence files throw errors in MR jobs > ------------------------------------------ > > Key: MAPREDUCE-1712 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1712 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: harchive > Affects Versions: 0.20.1 > Reporter: Paul Yang > Assignee: Mahadev konar > > When a HAR is specified as the input for a map reduce job and the file format is sequence file, an error similar to the following is thrown (this one is from Hive). > {code} > java.lang.IllegalArgumentException: Offset 0 is outside of file (0..-1) > at org.apache.hadoop.mapred.FileInputFormat.getBlockIndex(FileInputFormat.java:299) > at org.apache.hadoop.mapred.FileInputFormat.getSplitHosts(FileInputFormat.java:455) > at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:260) > at org.apache.hadoop.hive.ql.io.HiveInputFormat.getSplits(HiveInputFormat.java:261) > at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:827) > at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:798) > at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:747) > at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:663) > at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:107) > at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:55) > at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:631) > at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:504) > at org.apache.hadoop.hive.ql.Driver.run(Driver.java:382) > at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:138) > at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:197) > at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:303) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.hadoop.util.RunJar.main(RunJar.java:156) > {code} > This is caused by the dummy block location returned by HarFileSystem.getFileBlockLocations(). -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira