Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DDEAFD40A for ; Fri, 7 Sep 2012 19:50:07 +0000 (UTC) Received: (qmail 64889 invoked by uid 500); 7 Sep 2012 19:50:07 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 64847 invoked by uid 500); 7 Sep 2012 19:50:07 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 64835 invoked by uid 99); 7 Sep 2012 19:50:07 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Sep 2012 19:50:07 +0000 Date: Sat, 8 Sep 2012 06:50:07 +1100 (NCT) From: "Ravi Prakash (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: <1847679313.51962.1347047407577.JavaMail.jiratomcat@arcas> In-Reply-To: <243036377.51906.1347046208261.JavaMail.jiratomcat@arcas> Subject: [jira] [Updated] (MAPREDUCE-4645) Providing a random seed to Slive should make the sequence of filenames completely deterministic MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-4645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Prakash updated MAPREDUCE-4645: ------------------------------------ Attachment: MAPREDUCE-4645.branch-0.23.patch This patch changes the dummy key for the SliveMapper to be a "splitID" and the Random number generator to be seeded with that splitID + user-specified seed. Also the PathFinder which generates the path, is given its own separate instance of Random, so that if you run the same Slive command twice, all ops will succeed the first time and fail the second time (because the file would already have been created / deleted the first time) > Providing a random seed to Slive should make the sequence of filenames completely deterministic > ----------------------------------------------------------------------------------------------- > > Key: MAPREDUCE-4645 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4645 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: performance, test > Affects Versions: 0.23.1, 2.0.0-alpha > Reporter: Ravi Prakash > Assignee: Ravi Prakash > Labels: performance, test > Attachments: MAPREDUCE-4645.branch-0.23.patch > > > Using the -random seed option still doesn't produce a deterministic sequence of filenames. Hence there's no way to replicate the performance test. If I'm providing a seed, its obvious that I want the test to be reproducible. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira