Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8092CE48D for ; Thu, 30 May 2013 17:11:27 +0000 (UTC) Received: (qmail 76344 invoked by uid 500); 30 May 2013 17:11:25 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 76289 invoked by uid 500); 30 May 2013 17:11:25 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 76271 invoked by uid 99); 30 May 2013 17:11:25 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 May 2013 17:11:25 +0000 Date: Thu, 30 May 2013 17:11:25 +0000 (UTC) From: "Konstantin Boudnik (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (MAPREDUCE-5211) Reducer intermediate files can collide during merge MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-5211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Boudnik updated MAPREDUCE-5211: ------------------------------------------ Target Version/s: 2.0.5-beta, 0.23.8 (was: 0.23.8) > Reducer intermediate files can collide during merge > --------------------------------------------------- > > Key: MAPREDUCE-5211 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-5211 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: mrv2 > Affects Versions: 0.23.7 > Reporter: Jason Lowe > Assignee: Jason Lowe > Priority: Blocker > Labels: 2.0.4.2 > Fix For: 0.23.8 > > Attachments: MAPREDUCE-5211.branch-0.23.patch > > > The OnDiskMerger.merge method constructs an output path that is not unique to a reduce attempt, and as a result can result in a file collision with other reducers from the same app that are running on the same node. In addition the name of the output file is based on MapOutput.toString which may not be unique in light of multi-pass merges on disk since the mapId will be null and the basename ends up as "MapOutput(null, DISK)" -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira