Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 540CBC2E7 for ; Tue, 8 May 2012 15:32:16 +0000 (UTC) Received: (qmail 88701 invoked by uid 500); 8 May 2012 15:32:16 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 88647 invoked by uid 500); 8 May 2012 15:32:16 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 88530 invoked by uid 99); 8 May 2012 15:32:15 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 May 2012 15:32:15 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 May 2012 15:32:10 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id CA1C843A1A4 for ; Tue, 8 May 2012 15:31:50 +0000 (UTC) Date: Tue, 8 May 2012 15:31:50 +0000 (UTC) From: "Avner BenHanoch (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: <1794487580.39222.1336491110829.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1070495047.41402.1332332379648.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (MAPREDUCE-4049) plugin for generic shuffle service MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAPREDUCE-4049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13270541#comment-13270541 ] Avner BenHanoch commented on MAPREDUCE-4049: -------------------------------------------- Hi Asokan, Perhaps I was not clear enough. Your patch for the trunk is good enough for my needs. I can write my RDMA shuffle plugin based on either your patch or based on my patch. Hence, I am not planning to submit additional patch for the trunk on top of your patch. (I will only submit patch for 1.x) *I welcome the watchers of this issue to commit your patch!* (BTW, regarding my request to make 4 classes public, I don't see any InterfaceStability annotation on these classes. Anyhow, this request is not mandatory for me. Still, if possible, I will be glad to have these classes public. It will enable my plugin to reuse these classes instead of duplicating parts of them. Also, minor correction: I mistakenly wrote MapOutput instead of MapHost) > plugin for generic shuffle service > ---------------------------------- > > Key: MAPREDUCE-4049 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4049 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: performance, task, tasktracker > Affects Versions: 1.1.0, 1.0.3, 2.0.0, 3.0.0 > Reporter: Avner BenHanoch > Labels: merge, plugin, rdma, shuffle > Attachments: HADOOP-1.0.2.patch, HADOOP-1.0.x.patch, HADOOP-1.0.x.patch, Hadoop Shuffle Consumer Plugin TLD.rtf, Hadoop Shuffle Provider Plugin TLD.rtf, MAPREDUCE-4049-branch-1.0.2.patch, mapred-site.xml, mapred.diff, src.tgz, test.diff > > > Support generic shuffle service as set of two plugins: ShuffleProvider & ShuffleConsumer. > This will satisfy the following needs: > # Better shuffle and merge performance. For example: we are working on shuffle plugin that performs shuffle over RDMA in fast networks (10gE, 40gE, or Infiniband) instead of using the current HTTP shuffle. Based on the fast RDMA shuffle, the plugin can also utilize a suitable merge approach during the intermediate merges. Hence, getting much better performance. > # Satisfy MAPREDUCE-3060 - generic shuffle service for avoiding hidden dependency of NodeManager with a specific version of mapreduce shuffle (currently targeted to 0.24.0). > References: > # Hadoop Acceleration through Network Levitated Merging, by Prof. Weikuan Yu from Auburn University with others, [http://pasl.eng.auburn.edu/pubs/sc11-netlev.pdf] > # I am attaching 2 documents with suggested Top Level Design for both plugins (currently, based on 1.0 branch) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira