hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From zuohua zhang <>
Subject best way to load millions of gzip files in hdfs to one table in hive?
Date Tue, 02 Oct 2012 19:53:28 GMT
I have millions of gzip files in hdfs (with the same fields), would like to
load them into one table in hive with a specified schema.
What is the most efficient ways to do that?
Given that my data is only in hdfs, and also gzipped, does that mean I
could just simply set up the table somehow bypassing some unnecessary
overhead of the typical approach?


View raw message