hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Harsh J <ha...@cloudera.com>
Subject Re: Understanding execution flow of Hadoop code
Date Thu, 08 Sep 2011 15:19:01 GMT
Which parts of Hadoop 0.20 specifically?

For MapReduce, try reading from job-submission parts upwards to
execution, and onto completions of a job. Rest should be as clear as
water then.

Similarly, for HDFS, read from file creations/opening upwards to
writes and reads. There's much more to HDFS than just that, so reading
design documents and other such public documentation helps too.

Or you could go the faster way and get yourself a copy of Tom White's
Hadoop: The Definitive Guide book, which covers all the major things
you need to know in a crisp, neat way. The rest, in 'high-definition'
as they say, lies in code.

On Thu, Sep 8, 2011 at 7:53 PM, ArunKumar <arunk786@gmail.com> wrote:
> Hi !
>
> I have checkout out Hadoop-0.20.2 code into eclipse.
> I have built Hadoop project and could run some programs also.
> 1. I am interested in understanding the execution flow of Hadoop codes.
> 2. I want to make changes to some parts of the code and see the effect of
> them.
> How do i go about it ?
>
> Regards,
> Arun
>
>
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Understanding-execution-flow-of-Hadoop-code-tp3319880p3319880.html
> Sent from the Hadoop lucene-dev mailing list archive at Nabble.com.
>



-- 
Harsh J

Mime
View raw message