Return-Path: X-Original-To: apmail-hadoop-mapreduce-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DC78ED992 for ; Thu, 15 Nov 2012 20:05:15 +0000 (UTC) Received: (qmail 19118 invoked by uid 500); 15 Nov 2012 20:05:14 -0000 Delivered-To: apmail-hadoop-mapreduce-dev-archive@hadoop.apache.org Received: (qmail 18578 invoked by uid 500); 15 Nov 2012 20:05:13 -0000 Mailing-List: contact mapreduce-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-dev@hadoop.apache.org Delivered-To: mailing list mapreduce-dev@hadoop.apache.org Received: (qmail 18470 invoked by uid 99); 15 Nov 2012 20:05:13 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Nov 2012 20:05:13 +0000 Date: Thu, 15 Nov 2012 20:05:13 +0000 (UTC) From: "Jason Lowe (JIRA)" To: mapreduce-dev@hadoop.apache.org Message-ID: <554150771.120811.1353009913447.JavaMail.jiratomcat@arcas> Subject: [jira] [Created] (MAPREDUCE-4801) ShuffleHandler can generate large logs due to prematurely closed channels MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Jason Lowe created MAPREDUCE-4801: ------------------------------------- Summary: ShuffleHandler can generate large logs due to prematurely closed channels Key: MAPREDUCE-4801 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4801 Project: Hadoop Map/Reduce Issue Type: Bug Affects Versions: 2.0.1-alpha, 0.23.3 Reporter: Jason Lowe Priority: Critical We ran into an instance where many nodes on a cluster ran out of disk space because the nodemanager logs were huge. Examining the logs showed many, many shuffle errors due to either ClosedChannelException or IOException from "Connection reset by peer" or "Broken pipe". -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira