Return-Path: X-Original-To: apmail-pig-dev-archive@www.apache.org Delivered-To: apmail-pig-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0021EF464 for ; Fri, 31 May 2013 22:30:21 +0000 (UTC) Received: (qmail 9358 invoked by uid 500); 31 May 2013 22:30:21 -0000 Delivered-To: apmail-pig-dev-archive@pig.apache.org Received: (qmail 9306 invoked by uid 500); 31 May 2013 22:30:21 -0000 Mailing-List: contact dev-help@pig.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pig.apache.org Delivered-To: mailing list dev@pig.apache.org Received: (qmail 9248 invoked by uid 500); 31 May 2013 22:30:21 -0000 Delivered-To: apmail-hadoop-pig-dev@hadoop.apache.org Received: (qmail 9223 invoked by uid 99); 31 May 2013 22:30:21 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 31 May 2013 22:30:21 +0000 Date: Fri, 31 May 2013 22:30:21 +0000 (UTC) From: "Cheolsoo Park (JIRA)" To: pig-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (PIG-3288) Kill jobs if the number of output files is over a configurable limit MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/PIG-3288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-3288: ------------------------------- Status: Patch Available (was: Open) > Kill jobs if the number of output files is over a configurable limit > -------------------------------------------------------------------- > > Key: PIG-3288 > URL: https://issues.apache.org/jira/browse/PIG-3288 > Project: Pig > Issue Type: Wish > Reporter: Cheolsoo Park > Assignee: Cheolsoo Park > Fix For: 0.12 > > Attachments: PIG-3288-2.patch, PIG-3288-3.patch, PIG-3288.patch > > > I ran into a situation where a Pig job tried to create too many files on hdfs and overloaded NN. To prevent such events, it would be nice if we could set a upper limit on the number of files that a Pig job can create. > In fact, Hive has a property called "hive.exec.max.created.files". The idea is that each mapper/reducer increases a counter every time when they create files. Then, MRLauncher periodically checks whether the number of created files so far has exceeded the upper limit. If so, we kill running jobs and exit. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira