Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 63340 invoked from network); 16 Feb 2010 21:37:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Feb 2010 21:37:51 -0000 Received: (qmail 94799 invoked by uid 500); 16 Feb 2010 21:37:51 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 94729 invoked by uid 500); 16 Feb 2010 21:37:51 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 94719 invoked by uid 99); 16 Feb 2010 21:37:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 21:37:51 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 21:37:49 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 608D4234C4D3 for ; Tue, 16 Feb 2010 13:37:28 -0800 (PST) Message-ID: <1248310748.318021266356248394.JavaMail.jira@brutus.apache.org> Date: Tue, 16 Feb 2010 21:37:28 +0000 (UTC) From: "Owen O'Malley (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Commented: (MAPREDUCE-326) The lowest level map-reduce APIs should be byte oriented MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAPREDUCE-326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12834485#action_12834485 ] Owen O'Malley commented on MAPREDUCE-326: ----------------------------------------- {quote} One group should not block avenues of development because they are not their current priority when they might be a current priority for others. {quote} We haven't blocked anything. There are two proposals on the table: * add an entirely new abstraction layer and API * make a small incremental change to the current API You insist on option 1. You haven't provided any arguments against the much simpler option 2. Having implemented the data pipeline (sort and shuffle) multiple times, I don't think it is wise to expose the details of that implementation. They *will* change to address new issues as they arise. I think it is quite indicative of the relative merits that the people with the most experience in the data path for the sort and shuffle (Arun, Chris, and me) all have strong concerns about your proposal. We've spent a lot of time and effort helping to refine your proposal to get your goal accomplished and yet you continue to ignore our advice and experience. This is an Apache project where we need to reach consensus. > The lowest level map-reduce APIs should be byte oriented > -------------------------------------------------------- > > Key: MAPREDUCE-326 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-326 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Reporter: eric baldeschwieler > Attachments: MAPREDUCE-326-api.patch, MAPREDUCE-326.pdf > > > As discussed here: > https://issues.apache.org/jira/browse/HADOOP-1986#action_12551237 > The templates, serializers and other complexities that allow map-reduce to use arbitrary types complicate the design and lead to lots of object creates and other overhead that a byte oriented design would not suffer. I believe the lowest level implementation of hadoop map-reduce should have byte string oriented APIs (for keys and values). This API would be more performant, simpler and more easily cross language. > The existing API could be maintained as a thin layer on top of the leaner API. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.