Return-Path: X-Original-To: apmail-crunch-dev-archive@www.apache.org Delivered-To: apmail-crunch-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1D0FBCEC4 for ; Fri, 1 Nov 2013 17:07:59 +0000 (UTC) Received: (qmail 59913 invoked by uid 500); 1 Nov 2013 17:07:32 -0000 Delivered-To: apmail-crunch-dev-archive@crunch.apache.org Received: (qmail 59790 invoked by uid 500); 1 Nov 2013 17:07:27 -0000 Mailing-List: contact dev-help@crunch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@crunch.apache.org Delivered-To: mailing list dev@crunch.apache.org Received: (qmail 59726 invoked by uid 500); 1 Nov 2013 17:07:25 -0000 Delivered-To: apmail-incubator-crunch-dev@incubator.apache.org Received: (qmail 59665 invoked by uid 99); 1 Nov 2013 17:07:21 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Nov 2013 17:07:21 +0000 Date: Fri, 1 Nov 2013 17:07:21 +0000 (UTC) From: "Josh Wills (JIRA)" To: crunch-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (CRUNCH-292) Hack around Hadoop2's job counter limits MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CRUNCH-292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Wills updated CRUNCH-292: ------------------------------ Attachment: CRUNCH-292.patch Here's a patch to fix it by dynamically creating new instances of Counters whenever we hit an exception creating a counter. The only hitch is that I can't really return the CounterGroup for a given group name using this approach, but I hope that doesn't cause too much of an issue for clients. > Hack around Hadoop2's job counter limits > ---------------------------------------- > > Key: CRUNCH-292 > URL: https://issues.apache.org/jira/browse/CRUNCH-292 > Project: Crunch > Issue Type: Bug > Components: Core > Affects Versions: 0.7.0 > Reporter: Josh Wills > Assignee: Josh Wills > Attachments: CRUNCH-292.patch > > > Hadoop2 introduces limits in the Counters library that set a maximum of 120 counters per job. These limits are really hard to hack around (for some good reasons); the only real way to override them is to update mapred-site.xml and restart the cluster. > This presents a challenge for Crunch's in-memory implementation, which uses the Counters library in local mode and can potentially generate well more than 120 counters when testing long pipelines. -- This message was sent by Atlassian JIRA (v6.1#6144)