hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dave <drive...@gmail.com>
Subject Ignore subdirectories when querying external table
Date Thu, 18 Aug 2011 22:53:28 GMT
Hi,

I have a partitioned external table in Hive, and in the partition
directories there are other subdirectories that are not related to the table
itself. Hive seems to want to scan those directories, as I am getting an
error message when trying to do a SELECT on the table:

Failed with exception java.io.IOException:java.io.IOException: Not a file:
hdfs://path/to/partition/path/to/subdir

Also, it seems to ignore directories prefixed by an underscore (_directory).

I am using hive 0.7.1 on Hadoop 0.20.2.

Is there a way to force Hive to ignore all subdirectories in external tables
and only look at files?

Thanks in advance,
-Dave

Mime
View raw message