hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marcello de Sales <>
Subject Best approach for data mashup... Import external Data from MySQL to a Hive table or use Custom Map using JDBC?
Date Tue, 16 Nov 2010 20:11:16 GMT
Hello all,

I'm using Hive to do mashups of data that is currently on HDFS and data is
that is on a MySQL database... I'm considering this scenario instead of
bringing the data from MySQL to HDFS because it is a matter verifying the
relating data from the query to the data in the external data storage
(mysql). This external data is has a fixed size measured in Megabytes and
provide indexes for calculations...

What's the best approach? Should I import the external MySQL tables into
Hive (probably using sqoop) and provide the needed joins/passes or should I
just use a Map and access the needed data using JDBC from a custom Map?


View raw message