hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shushant Arora <shushantaror...@gmail.com>
Subject when to use hive vs hbase
Date Wed, 30 Apr 2014 08:34:44 GMT
I have a requirement of processing huge weblogs on daily basis.

1. data will come incremental to datastore on daily basis and I  need
cumulative and daily
distinct user count from logs and after that aggregated data will be loaded
in RDBMS like mydql.

2.data will be loaded in hdfs datawarehouse on daily basis and same will be
fetched from Hdfs warehouse after some filtering in RDMS like mysql and
will be processed there.

Which datawarehouse is suitable for approach 1 and 2 and why?.


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message