hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hadoop2OnWindows" by ArpitAgarwal
Date Fri, 17 Jan 2014 19:42:21 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Hadoop2OnWindows" page has been changed by ArpitAgarwal:
https://wiki.apache.org/hadoop/Hadoop2OnWindows

New page:
= Installing Hadoop 2.2 or newer on Windows =

== Introduction ==
Hadoop version 2.2 onwards includes native support for Windows. You can build and install
a Hadoop distribution natively on Windows without using Cygwin. The official Apache Hadoop
releases do not include Windows binaries (yet, as of January 2014). However the good news
building a Windows package from source is well documented.

Hadoop is a complex system with many components so some familiarity with it at least at a
high level is helpful before attempting to build or install it for the first time.

----

== Building Hadoop for Windows ==
=== Choose target OS version ===
The Hadoop developers have used ''Windows 2008 Server'' and ''Windows 2008 Server R2'' during
development and testing. ''Windows Vista'' and ''Windows 7'' are also likely to work because
of their similarities with the respective server SKUs. We have '''not''' tested on ''Windows
XP'' or any earlier versions of Windows and these are not likely to work. Any issues reported
on ''Windows XP'' or earlier will be closed as ''Invalid''.

=== Getting Hadoop sources ===
The current stable release as of January 2014 is 2.2. The source distribution can be retrieved
from Apache either using subversion or git.
 * Subversion URL: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2.2
 * Git repository URL: git://git.apache.org/hadoop-common.git. After downloading the sources
via git, switch to the stable 2.2 using '''''git checkout branch-2.2''''', or use the appropriate
branch name if you are targeting a newer version.

=== Installing Dependencies and Setting up Environment for Building ===
The [[https://svn.apache.org/viewvc/hadoop/common/branches/branch-2.2/BUILDING.txt?revision=1547143&view=markup|BUILDING.txt]]
file in the root of the source tree has detailed information on the list of requirements and
how to install them. It also includes information on setting up the environment and a few
quirks. It is strongly recommended that you read [[https://svn.apache.org/viewvc/hadoop/common/branches/branch-2.2/BUILDING.txt?revision=1547143&view=markup|BUILDING.txt]]
thoroughly.

=== Building the Package ===
To build a binary distribution run the following command '''''mvn package -Pdist,native,docs
-DskipTests -Dtar'''''

=== A few words on Native IO support ===
Hadoop includes optional Native IO support for Linux for performance. However Native IO is
mandatory on Windows and without it you will not be able to get your installation working.
It is mandatory that you follow the instructions from BUILDING.txt to download dependencies
to build native libraries.

''TODO: Document installation in more detail''

== Conclusion ==
=== Caveats ===
The following features are yet to be implemented for Windows.
 * Hadoop Security
 * Short-circuit reads
=== Questions? ===
If you have any questions you can request help from the [[https://hadoop.apache.org/mailing_lists.html|Hadoop
mailing lists]]. For help with building Hadoop on Windows, send mail to '''common-dev@hadoop.apache.org'''.
For all other questions send email to '''user@hadoop.apache.org'''. Subscribe/unsubscribe
information is included on the linked webpage. Please note that the mailing lists are monitored
by volunteers.

Mime
View raw message