Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 28301D132 for ; Tue, 6 Nov 2012 00:32:12 +0000 (UTC) Received: (qmail 14369 invoked by uid 500); 6 Nov 2012 00:32:12 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 14318 invoked by uid 500); 6 Nov 2012 00:32:12 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 14310 invoked by uid 99); 6 Nov 2012 00:32:12 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Nov 2012 00:32:12 +0000 Date: Tue, 6 Nov 2012 00:32:11 +0000 (UTC) From: "Kihwal Lee (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: <922372241.71645.1352161932050.JavaMail.jiratomcat@arcas> In-Reply-To: <162795736.71643.1352161812521.JavaMail.jiratomcat@arcas> Subject: [jira] [Commented] (YARN-202) Log Aggregation generates a storm of fsync() for namenode MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13491087#comment-13491087 ] Kihwal Lee commented on YARN-202: --------------------------------- This problem will probably go away if we can leave out hflush() from LogWriter#append(). > Log Aggregation generates a storm of fsync() for namenode > --------------------------------------------------------- > > Key: YARN-202 > URL: https://issues.apache.org/jira/browse/YARN-202 > Project: Hadoop YARN > Issue Type: Bug > Affects Versions: 2.0.2-alpha, 0.23.4 > Reporter: Kihwal Lee > Priority: Critical > > When the log aggregation is on, write to each aggregated container log causes hflush() to be called. For large clusters, this can creates a lot of fsync() calls for namenode. > We have seen 6-7x increase in the average number of fsync operations compared to 1.0.x on a large busy cluster. Over 99% of fsync ops were for log aggregation writing to tmp files. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira