hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Edward Capriolo <>
Subject Re: Query HDFS files without using LOAD (move)
Date Wed, 26 May 2010 18:09:48 GMT
On Wed, May 26, 2010 at 2:03 PM, Ashish Thusoo <> wrote:

> You could probably use external tables?? CREATE EXTERNAL TABLE allows you
> to create tables on hdfs files but I do not think that it takes file
> patterns / regex. If all the files are created within a directory then you
> could point the external table to the directory location and then querying
> on that table would automatically query all the files in that directory. Are
> your files in a single directory or are they spread out?
> Ashish
> -----Original Message-----
> From: Karthik []
> Sent: Wednesday, May 26, 2010 10:45 AM
> To:
> Subject: Query HDFS files without using LOAD (move)
> Is there a way where I can specify a list of files (or file pattern /
> regex) from a HDFS location other than the Hive Warehouse as a parameter to
> a Hive Query?  I have a bunch of files that are used by other applications
> as well and I need to perform queries on those as well using Hive and so I
> do not want to use LOAD and move those files on to Hive warehouse from the
> original location.
> My query is on incremental data (new files) that are added on a daily basis
> and need not use the full list of files on a folder and so I need to specify
> a list of file / pattern, something like a filter of files to the query.
> Please suggest.
> - KK.
> Also in trunk there is a feature that uses a file with a list of files as
the input. I do not know the Jira #

View raw message