Return-Path: X-Original-To: apmail-hadoop-common-dev-archive@www.apache.org Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C68A9E4D3 for ; Fri, 8 Feb 2013 02:42:45 +0000 (UTC) Received: (qmail 55461 invoked by uid 500); 8 Feb 2013 02:42:43 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 55384 invoked by uid 500); 8 Feb 2013 02:42:43 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 55340 invoked by uid 99); 8 Feb 2013 02:42:43 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Feb 2013 02:42:43 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS,UNRESOLVED_TEMPLATE X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mahadv@microsoft.com designates 207.46.100.30 as permitted sender) Received: from [207.46.100.30] (HELO na01-by2-obe.outbound.protection.outlook.com) (207.46.100.30) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Feb 2013 02:42:33 +0000 Received: from BY2FFO11FD012.protection.gbl (10.1.15.200) by BY2FFO11HUB010.protection.gbl (10.1.14.80) with Microsoft SMTP Server (TLS) id 15.0.609.9; Fri, 8 Feb 2013 02:42:10 +0000 Received: from TK5EX14HUBC103.redmond.corp.microsoft.com (131.107.125.37) by BY2FFO11FD012.mail.protection.outlook.com (10.1.14.130) with Microsoft SMTP Server (TLS) id 15.0.609.9 via Frontend Transport; Fri, 8 Feb 2013 02:42:10 +0000 Received: from tx2outboundpool.messaging.microsoft.com (157.54.51.81) by mail.microsoft.com (157.54.86.9) with Microsoft SMTP Server (TLS) id 14.2.318.3; Fri, 8 Feb 2013 02:41:55 +0000 Received: from mail165-tx2-R.bigfish.com (10.9.14.247) by TX2EHSOBE013.bigfish.com (10.9.40.33) with Microsoft SMTP Server id 14.1.225.23; Fri, 8 Feb 2013 02:41:54 +0000 Received: from mail165-tx2 (localhost [127.0.0.1]) by mail165-tx2-R.bigfish.com (Postfix) with ESMTP id 5AE34220373; Fri, 8 Feb 2013 02:41:54 +0000 (UTC) X-Forefront-Antispam-Report-Untrusted: CIP:157.56.240.21;KIP:(null);UIP:(null);(null);H:BL2PRD0310HT004.namprd03.prod.outlook.com;R:internal;EFV:INT X-SpamScore: 0 X-BigFish: PS0(zz9371I542Izz1f42h1ee6h1de0h1202h1e76h1d1ah1d2ahzz17326ah8275dhz31h2a8h668h839h944hd24hf0ah1220h1288h12a5h12a9h12bdh137ah13b6h1441h1504h1537h153bh162dh1631h1758h18e1h1946h19b5h9a9j1155h) Received-SPF: softfail (mail165-tx2: transitioning domain of microsoft.com does not designate 157.56.240.21 as permitted sender) client-ip=157.56.240.21; envelope-from=mahadv@microsoft.com; helo=BL2PRD0310HT004.namprd03.prod.outlook.com ;.outlook.com ; X-Forefront-Antispam-Report-Untrusted: SFV:SKI;SFS:;DIR:OUT;SFP:;SCL:-1;SRVR:BLUPR03MB019;H:BLUPR03MB018.namprd03.prod.outlook.com;LANG:en; Received: from mail165-tx2 (localhost.localdomain [127.0.0.1]) by mail165-tx2 (MessageSwitch) id 1360291311159887_16574; Fri, 8 Feb 2013 02:41:51 +0000 (UTC) Received: from TX2EHSMHS034.bigfish.com (unknown [10.9.14.247]) by mail165-tx2.bigfish.com (Postfix) with ESMTP id 2113840085; Fri, 8 Feb 2013 02:41:51 +0000 (UTC) Received: from BL2PRD0310HT004.namprd03.prod.outlook.com (157.56.240.21) by TX2EHSMHS034.bigfish.com (10.9.99.134) with Microsoft SMTP Server (TLS) id 14.1.225.23; Fri, 8 Feb 2013 02:41:50 +0000 Received: from BLUPR03MB019.namprd03.prod.outlook.com (10.255.208.41) by BL2PRD0310HT004.namprd03.prod.outlook.com (10.255.97.39) with Microsoft SMTP Server (TLS) id 14.16.263.1; Fri, 8 Feb 2013 02:41:49 +0000 Received: from BLUPR03MB018.namprd03.prod.outlook.com (10.255.208.40) by BLUPR03MB019.namprd03.prod.outlook.com (10.255.208.41) with Microsoft SMTP Server (TLS) id 15.0.620.10; Fri, 8 Feb 2013 02:41:48 +0000 Received: from BLUPR03MB018.namprd03.prod.outlook.com ([169.254.4.236]) by BLUPR03MB018.namprd03.prod.outlook.com ([169.254.4.236]) with mapi id 15.00.0620.005; Fri, 8 Feb 2013 02:41:48 +0000 From: Mahadevan Venkatraman To: "common-dev@hadoop.apache.org" , "yarn-dev@hadoop.apache.org" , "hdfs-dev@hadoop.apache.org" , "mapreduce-dev@hadoop.apache.org" Subject: RE: Heads up - merge branch-trunk-win to trunk Thread-Topic: Heads up - merge branch-trunk-win to trunk Thread-Index: AQHOBZ2xonAQdumCH0K6Q7DFAFBONZhvOF8g Date: Fri, 8 Feb 2013 02:41:47 +0000 Message-ID: References: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.255.156.132] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OrganizationHeadersPreserved: BLUPR03MB019.namprd03.prod.outlook.com X-FOPE-CONNECTOR: Id%0$Dn%*$RO%0$TLS%0$FQDN%$TlsDn% X-FOPE-CONNECTOR: Id%59$Dn%HADOOP.APACHE.ORG$RO%2$TLS%6$FQDN%corpf5vips-237160.customer.frontbridge.com$TlsDn% X-CrossPremisesHeadersPromoted: TK5EX14HUBC103.redmond.corp.microsoft.com X-CrossPremisesHeadersFiltered: TK5EX14HUBC103.redmond.corp.microsoft.com X-Forefront-Antispam-Report: CIP:131.107.125.37;CTRY:US;IPV:CAL;IPV:NLI;EFV:NLI;SFV:NSPM;SFS:(199002)(13464002)(189002)(377454001)(47736001)(50466001)(23726001)(4396001)(33646001)(49866001)(47776003)(54316002)(20776003)(65816001)(54356001)(50986001)(46406002)(63696002)(79102001)(76482001)(47976001)(59766001)(16676001)(47446002)(56816002)(44976002)(31966008)(46102001)(74502001)(77982001)(56776001)(53806001)(5343655001)(51856001)(6806001)(74662001)(80022001)(24736002);DIR:OUT;SFP:;SCL:1;SRVR:BY2FFO11HUB010;H:TK5EX14HUBC103.redmond.corp.microsoft.com;RD:InfoDomainNonexistent;A:1;MX:1;LANG:en; X-OriginatorOrg: microsoft.onmicrosoft.com X-Forefront-PRVS: 0751474A44 X-Virus-Checked: Checked by ClamAV on apache.org It is super exciting to look at the prospect of these changes being merged = to trunk. Having Windows as one of the supported Hadoop platforms is a fant= astic opportunity both for the Hadoop project and Microsoft customers. This work began around a year back when a few of us started with a basic po= rt of Hadoop on Windows. Ever since, the Hadoop team in Microsoft have made= significant progress in the following areas: (PS: Some of these items are already included in Suresh's email, but includ= ing again for completeness) - Command-line scripts for the Hadoop surface area - Mapping the HDFS permissions model to Windows - Abstracted and reconciled mismatches around differences in Path semantics= in Java and Windows - Native Task Controller for Windows=20 - Implementation of a Block Placement Policy to support cloud environments,= more specifically Azure. - Implementation of Hadoop native libraries for Windows (compression codecs= , native I/O) - Several reliability issues, including race-conditions, inte= rmittent test failures, resource leaks. - Several new unit test cases written for the above changes In the process, we have closely engaged with the Apache open source communi= ty and have got great support and assistance from the community in terms of= contributing fixes, code review comments and commits.=20 In addition, the Hadoop team at Microsoft has also made good progress in ot= her projects including Hive, Pig, Sqoop, Oozie, HCat and HBase. Many of the= se changes have already been committed to the respective trunks with help f= rom various committers and contributors. It is great to see the commitment = of the community to support multiple platforms, and we look forward to the = day when a developer/customer is able to successfully deploy a complete sol= ution stack based on Apache Hadoop releases. Next Steps: All of the above changes are part of the Windows Azure HDInsight and HDInsi= ght Server products from Microsoft. We have successfully on-boarded several= internal customers and have been running production workloads on Windows A= zure HDInsight. Our vision is to create a big data platform based on Hadoop= , and we are committed to helping make Hadoop a world-class solution that a= nyone can use to solve their biggest data challenges.=20 As an immediate next step, we would like to have a discussion around how we= can ensure that the quality of the mainline Hadoop branches on Windows is = maintained. To this end, we would like to get to the state where we have pr= e-checkin validation gates and nightly test runs enabled on Windows. If you= have any suggestions around this, please do send an email. We are committ= ed to helping sustain the long-term quality of Hadoop on both Linux and Win= dows. We sincerely thank the community for their contribution and support so far.= And hope to continue having a close engagement in the future. -Microsoft HDInsight Team -----Original Message----- From: Suresh Srinivas [mailto:suresh@hortonworks.com]=20 Sent: Thursday, February 7, 2013 5:42 PM To: common-dev@hadoop.apache.org; yarn-dev@hadoop.apache.org; hdfs-dev@hado= op.apache.org; mapreduce-dev@hadoop.apache.org Subject: Heads up - merge branch-trunk-win to trunk The support for Hadoop on Windows was proposed in HADOOP-8079 almost a year ago. The goal was to ma= ke Hadoop natively integrated, full-featured, and performance and scalabili= ty tuned on Windows Server or Windows Azure. We are happy to announce that a lot of progress has been made in this regar= d. Initial work started in a feature branch, branch-1-win, based on branch-1. The details related to the work done in the branch can be seen in CHANGES.t= xt. This work has been ported to a branch, branch-trunk-win, based on trunk. Merge patch for this is available on HADOOP-8562 . Highlights of the work done so far: 1. Necessary changes in Hadoop to run natively on Windows. These changes ha= ndle differences in platforms related to path names, process/task managemen= t etc. 2. Addition of winutils tools for managing file permissions and ownership, = user group mapping, hardlinks, symbolic links, chmod, disk utilization, and= process/task management. 3. Added cmd scripts equivalent to existing shell scripts hadoop-daemon.sh,= start and stop scripts. 4. Addition of block placement policy implemnation to support cloud envirom= ent, more specifically Azure. We are very close to wrapping up the work in branch-trunk-win and getting r= eady for a merge. Currently the merge patch is passing close to 100% of uni= t tests on Linux. Soon I will call for a vote to merge this branch into tru= nk. Next steps: 1. Call for vote to merge branch-trunk-win to trunk, when the work complete= s and precommit build is clean. 2. Start a discussion on adding Jenkins precommit builds on windows and how= to integrate that with the existing commit process. Let me know if you have any questions. Regards, Suresh