Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8E7B479A8 for ; Wed, 7 Dec 2011 16:27:01 +0000 (UTC) Received: (qmail 2655 invoked by uid 500); 7 Dec 2011 16:27:01 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 2586 invoked by uid 500); 7 Dec 2011 16:27:01 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 2578 invoked by uid 99); 7 Dec 2011 16:27:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Dec 2011 16:27:01 +0000 X-ASF-Spam-Status: No, hits=-2001.2 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Dec 2011 16:27:00 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 0F806105946 for ; Wed, 7 Dec 2011 16:26:40 +0000 (UTC) Date: Wed, 7 Dec 2011 16:26:40 +0000 (UTC) From: "Robert Joseph Evans (Commented) (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: <505822707.49727.1323275200065.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1760760269.43518.1323131620282.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (MAPREDUCE-3512) Batch jobHistory disk flushes MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-3512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164483#comment-13164483 ] Robert Joseph Evans commented on MAPREDUCE-3512: ------------------------------------------------ Unless we are some how stopping Tasks from doing any work until the event it written out to the history file, batching up the writes will reduce the number of tasks that have to rerun on AM Recovery. This is because we already have the events batched in the queue and if we crash while they are in the queue we cannot recover them. Perhaps what we want to do is to have a non-blocking check of the event queue so we can batch all events currently on the queue up to a given number of events in a single write. This way if there are not very many events we do more writes and the events are output quickly but if we start to fall behind in the writes then we start batching them up into bigger chunks which are more efficient. > Batch jobHistory disk flushes > ----------------------------- > > Key: MAPREDUCE-3512 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-3512 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: mr-am, mrv2 > Affects Versions: 0.23.0 > Reporter: Siddharth Seth > > The mr-am flushes each individual job history event to disk for AM recovery. The history even handler ends up with a significant backlog for tests like MAPREDUCE-3402. > History events could be batched up based on num records / time / TaskFinishedEvents to reduce the number of DFS writes - with the potential drawback of having to rerun some tasks during AM recovery. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira