hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zheng Shao (JIRA)" <j...@apache.org>
Subject [jira] Created: (HIVE-1001) CombinedHiveInputFormat should parse the inputpath correctly
Date Mon, 21 Dec 2009 07:44:18 GMT
CombinedHiveInputFormat should parse the inputpath correctly

                 Key: HIVE-1001
                 URL: https://issues.apache.org/jira/browse/HIVE-1001
             Project: Hadoop Hive
          Issue Type: Bug
    Affects Versions: 0.5.0
            Reporter: Zheng Shao

>From David Lerman:
I'm running into errors where CombinedHiveInputFormat is combining data from
two different tables which is causing problems because the tables have
different input formats.

It looks like the problem is in
org.apache.hadoop.hive.shims.Hadoop20Shims.getInputPathsShim.  It calls
CombineFileInputFormat.getInputPaths which returns the list of input paths
and then chops off the first 5 characters to remove file: from the
beginning, but the return value I'm getting from getInputPaths is actually
hdfs://domain/path.  So then when it creates the pools using these paths,
none of the input paths match the pools (since they're just the file path
which protocol or domain).

We should use Path.getPath() to get the path part of an URI instead of just chopping off 5

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message