Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 6160 invoked from network); 17 Mar 2011 01:22:57 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 17 Mar 2011 01:22:57 -0000 Received: (qmail 17081 invoked by uid 500); 17 Mar 2011 01:22:57 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 17039 invoked by uid 500); 17 Mar 2011 01:22:57 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 17031 invoked by uid 99); 17 Mar 2011 01:22:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Mar 2011 01:22:57 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Mar 2011 01:22:54 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 3210D3ABCE0 for ; Thu, 17 Mar 2011 01:22:32 +0000 (UTC) Date: Thu, 17 Mar 2011 01:22:32 +0000 (UTC) From: "Arun C Murthy (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: <77334313.8058.1300324952201.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Updated: (MAPREDUCE-279) Map-Reduce 2.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAPREDUCE-279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun C Murthy updated MAPREDUCE-279: ------------------------------------ Attachment: MR-279_MR_files_to_move.txt MR-279.sh MR-279.patch Folks, we are happy to put out a first cut of MRv2. A brief overview: A global ResourceManager (RM) tracks machine availability and scheduling invariants while a per-application ApplicationMaster (AM) runs inside the cluster and tracks the program semantics for a given job. An application is either a single MapReduce job as the JobTracker supports today, it could be a directed, acyclic graph (DAG) of MapReduce jobs, or it could be a new framework. Each machine in the cluster runs a per-node daemon, the NodeManager (NM), responsible for enforcing and reporting the resource allocations made by the RM and monitoring the lifecycle of processes spawned on behalf of an application. Each process started by the NM is conceptually a container, or a bundle of resources allocated by the RM. We call the new framework (RM/NM) as YARN (Yet Another Resource Negotiator)... ;-) Source layout: # A new yarn source folder contains the RM and NM. # A new mr-client folder contains all of the MapReduce runtime. This includes the MapReduce ApplicationMaster and all of the classes for running MapReduce applications. Please note that the MR runtime has not changed at all, including the user apis - we continue to support both the old 'mapred' api and the new 'mapreduce' api (context-objects). We are moving some classes from src/java/mapred/* to mr-client to achieve the same. # We have continued to keep the old JobTracker/TaskTracker based MapReduce framework in src/java. Build: # We decided to embrace maven for MRv2, hence yarn and mr-client are built via maven. # For now the old JT/TT based MR framework continues to use ant/ivy. Hopefully we can change this soon - I know Giri is working on this for common, hdfs and mapreduce at one go. There is a INSTALL file which describes how to build, deploy MRv2 and also how to run MR applications. ---- I'm planning on committing this patch to a development branch (named MAPREDUCE-279) soon so that we can continue all our work via Apache in the open. We *really* look forward to feedback and working with the community henceforth. We have many many miles to go and promises to keep! ;-) PS: I have attached a script (MR-279.sh) to show the the files being moved to mr-client for the MR runtime, a list of files being moved and the actual patch to apply after. Also, please note that the patch is significantly bigger than it should be since it includes binary images (via git diff --text). > Map-Reduce 2.0 > -------------- > > Key: MAPREDUCE-279 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-279 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: jobtracker, tasktracker > Reporter: Arun C Murthy > Assignee: Arun C Murthy > Fix For: 0.23.0 > > Attachments: MR-279.patch, MR-279.sh, MR-279_MR_files_to_move.txt > > > Re-factor MapReduce into a generic resource scheduler and a per-job, user-defined component that manages the application execution. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira