hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "GitAndHadoop" by SteveLoughran
Date Fri, 27 Nov 2009 15:26:13 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "GitAndHadoop" page has been changed by SteveLoughran.
The comment on this change is: Page on Git.


New page:
= Git And Hadoop =

A lot of people use Git with Hadoop because they have their own patches to make to Hadoop,
and Git helps them manage it.

 * GitHub provide some good lessons on git at [[http://learn.github.com]]
 * Apache serves up read-only Git versions of their source at [[http://git.apache.org/]].
People cannot commit changes with Git; for that the patches need to be applied to the SVN

== Before you begin ==

You need a copy of git on your system. Some IDEs ship with Git support; this page assumes
you are using the command line.

== Checking out the source ==

The first step is to create your own Git repository from the Apache repositories. There are
separate repositories for all the different Hadoop sub-projects; this page looks at the core
filesystem and MapReduce engine.

Make your base hadoop directory
mkdir hadoop
Change into this directory
cd hadoop
Create "clones" of the Apache Git repositories
git clone git://git.apache.org/hadoop-common.git
git clone git://git.apache.org/hadoop-hdfs.git
git clone git://git.apache.org/hadoop-mapreduce.git
The total download is well over 100MB, so the initial checkout process works best when the
network is fast. Once downloaded, Git works offline.

View raw message