Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0965164DD for ; Mon, 13 Jun 2011 12:51:15 +0000 (UTC) Received: (qmail 47181 invoked by uid 500); 13 Jun 2011 12:51:11 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 47033 invoked by uid 500); 13 Jun 2011 12:51:11 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 46444 invoked by uid 99); 13 Jun 2011 12:51:10 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Jun 2011 12:51:10 +0000 X-ASF-Spam-Status: No, hits=-1.6 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [192.6.10.60] (HELO tobor.hpl.hp.com) (192.6.10.60) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Jun 2011 12:51:02 +0000 Received: from localhost (localhost [127.0.0.1]) by tobor.hpl.hp.com (Postfix) with ESMTP id 51600B7D23 for ; Mon, 13 Jun 2011 13:50:40 +0100 (BST) X-Virus-Scanned: amavisd-new at hplb.hpl.hp.com Received: from tobor.hpl.hp.com ([127.0.0.1]) by localhost (tobor.hpl.hp.com [127.0.0.1]) (amavisd-new, port 10024) with LMTP id Oo4+v5NtxU7B for ; Mon, 13 Jun 2011 13:50:39 +0100 (BST) Received: from 0-imap-br1.hpl.hp.com (0-imap-br1.hpl.hp.com [16.25.144.60]) by tobor.hpl.hp.com (Postfix) with ESMTP id 81825B7D22 for ; Mon, 13 Jun 2011 13:50:39 +0100 (BST) MailScanner-NULL-Check: 1308574225.61095@TZaj28qcuv6p2kGk/zBlDA Received: from [16.25.175.86] (wildhaus.hpl.hp.com [16.25.175.86]) by 0-imap-br1.hpl.hp.com (8.14.1/8.13.4) with ESMTP id p5DCoPeL022933 for ; Mon, 13 Jun 2011 13:50:25 +0100 (BST) Message-ID: <4DF60791.4070803@apache.org> Date: Mon, 13 Jun 2011 13:50:25 +0100 From: Steve Loughran User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.17) Gecko/20110419 Red Hat/3.1.10-1.el6_0 Thunderbird/3.1.10 MIME-Version: 1.0 To: common-user@hadoop.apache.org Subject: Re: Hadoop on windows with bat and ant scripts References: <31815353.post@talk.nabble.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-HPL-MailScanner-Information: Please contact the ISP for more information X-MailScanner-ID: p5DCoPeL022933 X-HPL-MailScanner: Found to be clean X-HPL-MailScanner-From: stevel@apache.org On 06/10/2011 03:23 PM, Bible, Landy wrote: > Hi Raja, > > I'm currently running HDFS on Windows 7 desktops. I had to create a hadoop.bat that provided the same functionality of the shell scripts, and some Java Service Wrapper configs to run the DataNodes and NameNode as windows services. Once I get my system more functional I plan to do a write up about how I did it, but it wasn't too difficult. I'd also like to see Hadoop become less platform dependent. why? Do you plan to bring up a real Windows server datacenter to test it on? >Java is supposed to be Write Once - Run Anywhere, but a lot of java projects seem to forget that. Java can be x-platform, but you have to consider the problems of testing on hundreds of machines, the fact that even System.execute() behaves differently on different systems, the networking setup and behaviour of windows is very different from Unix, etc. Whether you like it or not, all the big Hadoop clusters run on Linux, not just for the licensing costs, but because it is what Hadoop is tested on at those scales, so it becomes self-reinforcing. Same for the JVM: Sun's standard JVM, not JRockit or anything else. Again, in a large datacenter you will find all the corner cases where that "runs anywhere" claim changes to "crashes one task tracker every hour". OS/X and Windows support is very much there for development, though even there I'd recommend switching to a Linux laptop to reduce the surprises when you go to the real cluster. Allen W will note that Solaris works too, but even then differences between Linux and SunOS caused problems. By having a de-facto agreement to focus on Linux as the back end, it lets the developers * have a single platform to dev and test on * worry about RPM and deb installers, not windows install/uninstall quirks. * share ready-to-use Linux VM images (as Cloudera do) for people to play with. * use the large cluster management tooling that exists for managing big Linux clusters (Kickstart, etc). I think it's important is for the client-side code to work on windows, for job submission to be x-platform, but getting server-side code to work well on windows is a lot harder than people expect. The OS wasn't really written for it, the Java Service Wrappers have their own issues (both the Apache one, which is derived from Tomcat, and the other one), and it's not something I'd recommend to go near unless you really have no choice in the matter. I speak from experience. Sorry. > > So far, I've been unable to make MapReduce work correctly. The services run, but things don't work, however I suspect that this is due to DNS not working correctly in my environment. yes, that's part of the anywhere you have to fix. Edit the host tables so that DNS and reverse DNS appears to work. That's c:\windows\system32\drivers\etc\hosts, unless on a win64 box it moves.