Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 17080 invoked from network); 10 Sep 2008 18:16:40 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 10 Sep 2008 18:16:40 -0000 Received: (qmail 46324 invoked by uid 500); 10 Sep 2008 18:16:34 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 46270 invoked by uid 500); 10 Sep 2008 18:16:33 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 46248 invoked by uid 99); 10 Sep 2008 18:16:33 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Sep 2008 11:16:33 -0700 X-ASF-Spam-Status: No, hits=-1999.6 required=10.0 tests=ALL_TRUSTED,SUBJECT_FUZZY_TION X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Sep 2008 18:15:43 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 17430234C1DB for ; Wed, 10 Sep 2008 11:15:45 -0700 (PDT) Message-ID: <563945203.1221070545094.JavaMail.jira@brutus> Date: Wed, 10 Sep 2008 11:15:45 -0700 (PDT) From: "Doug Cutting (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-4143) Support for a "raw" Partitioner that partitions based on the serialized key and not record objects In-Reply-To: <1335264166.1221011744181.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-4143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12629895#action_12629895 ] Doug Cutting commented on HADOOP-4143: -------------------------------------- > Unfortunately, Partitioner is an interface [ ... ] This is a non-issue with HADOOP-1230. If this feature is needed soon, then we should push harder on that one. That said, I'm still not clear on the motivation. Is it performance? Comparators already provide both raw and cooked comparisons. If a partitioner is defined in terms of a comparator, it must currently used a cooked comparison, which might be slower. If this is a performance issue, then we should measure the potential performance improvement with a benchmark before we consider the API change. Are there non-performance reasons for this change? > Support for a "raw" Partitioner that partitions based on the serialized key and not record objects > -------------------------------------------------------------------------------------------------- > > Key: HADOOP-4143 > URL: https://issues.apache.org/jira/browse/HADOOP-4143 > Project: Hadoop Core > Issue Type: Improvement > Components: mapred > Reporter: Chris Douglas > Attachments: 4143-0.patch > > > For some partitioners (particularly those using comparators to classify keys), it would be helpful if one could specify a "raw" partitioner that would receive the serialized version of the key rather than the object emitted from the map. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.