hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Biju Kaimal <b...@kaimal.net>
Subject Performance between Hive queries vs. Hive over HBase queries
Date Tue, 08 Mar 2011 05:59:56 GMT
Hi,

I loaded a data set which has 1 million rows into both Hive and HBase
tables. For the HBase table, I created a corresponding Hive table so that
the data in HBase can be queried from Hive QL. Both tables have a key column
and a value column

For the same query (select value, count(*) from table group by value), the
Hive only query runs much faster (~ 30 seconds) as compared to Hive over
HBase (~ 150 seconds).

Is this expected?

Regards,
Biju

Mime
View raw message