Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 85608 invoked from network); 11 Nov 2008 19:34:07 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Nov 2008 19:34:07 -0000 Received: (qmail 40288 invoked by uid 500); 11 Nov 2008 19:34:12 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 40254 invoked by uid 500); 11 Nov 2008 19:34:12 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 40243 invoked by uid 99); 11 Nov 2008 19:34:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Nov 2008 11:34:12 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Nov 2008 19:33:01 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 6CD3F234C295 for ; Tue, 11 Nov 2008 11:33:44 -0800 (PST) Message-ID: <140060679.1226432024444.JavaMail.jira@brutus> Date: Tue, 11 Nov 2008 11:33:44 -0800 (PST) From: "Ravi Gummadi (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Updated: (HADOOP-2774) Add counters to show number of key/values that have been sorted and merged in the maps and reduces In-Reply-To: <2132140.1201939988789.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-2774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Gummadi updated HADOOP-2774: --------------------------------- Fix Version/s: 0.20.0 Status: Patch Available (was: Open) The following 2 new counters (1) Map First Level Spills(Number of first level spills in map task) and (2) Spilled Records(number of records spilled to disk) --- both in Maps and Reduces are added with this patch. > Add counters to show number of key/values that have been sorted and merged in the maps and reduces > -------------------------------------------------------------------------------------------------- > > Key: HADOOP-2774 > URL: https://issues.apache.org/jira/browse/HADOOP-2774 > Project: Hadoop Core > Issue Type: Bug > Reporter: Owen O'Malley > Assignee: Ravi Gummadi > Fix For: 0.20.0 > > > For each *pass* of the sort and merge, I would like a count of the number of records. So for example, if the map output 100 records and they were sorted once, the counter would be 100. If it spilled twice and was merged together, it would be 200. Clearly in a multi-level merge, it may not be a multiple of the number of map output records. This would let the users easily see if they have values like io.sort.mb or io.sort.factor set too low. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.