hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeff Hammerbacher <ham...@cloudera.com>
Subject Re: Hadoop mapReduce clarification
Date Sun, 30 Aug 2009 22:04:22 GMT
(moved to mapreduce-user)
Hey Deepak,

There's a slightly dated blog post from Aaron Kimball about how to use
DBInputFormat to run a Hadoop MapReduce job over records stored in a

If you'd rather move the data from your RDBMS into HDFS and perform the
MapReduce processing there, check out Sqoop:


On Sat, Aug 29, 2009 at 2:04 PM, Deepak Halale <deepak.halale@gmail.com>wrote:

> Hi
> I am new to Hadoop and trying to write mapReduce job which gets data from
> database
> i.e x number of fields and then processes it .
> can you point me in direction where to get this information.
> I was able to run the example mapReduce wordcount sample on single machine
> ,
> but not getting the gist how the data with mulitpe columns in the flat file
> or otherwise is processed
> Thank you
> Deepak H

View raw message