Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 37F23200C72 for ; Fri, 28 Apr 2017 06:52:10 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 36748160BB2; Fri, 28 Apr 2017 04:52:10 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 7E4C6160BA7 for ; Fri, 28 Apr 2017 06:52:09 +0200 (CEST) Received: (qmail 41684 invoked by uid 500); 28 Apr 2017 04:52:08 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 41673 invoked by uid 99); 28 Apr 2017 04:52:08 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Apr 2017 04:52:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 2E8B6C0370 for ; Fri, 28 Apr 2017 04:52:08 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id DyAYzVZTxhK9 for ; Fri, 28 Apr 2017 04:52:07 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 54E3F5F485 for ; Fri, 28 Apr 2017 04:52:06 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 5FFF5E0A31 for ; Fri, 28 Apr 2017 04:52:05 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 568CB21DDF for ; Fri, 28 Apr 2017 04:52:04 +0000 (UTC) Date: Fri, 28 Apr 2017 04:52:04 +0000 (UTC) From: "Duo Zhang (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-17970) Set yarn.app.mapreduce.am.staging-dir when starting MiniMRCluster MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 28 Apr 2017 04:52:10 -0000 [ https://issues.apache.org/jira/browse/HBASE-17970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15988199#comment-15988199 ] Duo Zhang commented on HBASE-17970: ----------------------------------- And I've found a difference between master and branch-1. On master, we will start MR cluster first but on branch-1 we will start HBase cluster first. The difference is that if we start MR cluster first, we will use LocalFileSystem rather than the mini DFS cluster as the working file system for MR cluster. And if we use the mini DFS cluster then the problem is gone? The mini dfs cluster is per instance, different tests will not share the same directory then. Let me prepare a patch to try it. > Set yarn.app.mapreduce.am.staging-dir when starting MiniMRCluster > ----------------------------------------------------------------- > > Key: HBASE-17970 > URL: https://issues.apache.org/jira/browse/HBASE-17970 > Project: HBase > Issue Type: Bug > Components: test > Reporter: Duo Zhang > Assignee: Duo Zhang > Attachments: HBASE-17970.patch > > > As discussed in mailing-list > https://builds.apache.org/job/HBASE-Flaky-Tests/15333/artifact/hbase-server/target/surefire-reports/org.apache.hadoop.hbase.snapshot.TestMobExportSnapshot-output.txt/*view*/ > Test*ExportSnapshot will fail if they are executed concurrently because of they share the same staging directory. > {noformat} > java.io.FileNotFoundException: File > file:/tmp/hadoop-yarn/staging/jenkins/.staging/job_1492809330233_0001/job.split > does not exist > at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:606) > at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:819) > at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:596) > at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:421) > at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:253) > at org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:63) > at org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:361) > at org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:359) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657) > at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:359) > at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:62) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > We need to set yarn.app.mapreduce.am.staging-dir to let hem use different directories. -- This message was sent by Atlassian JIRA (v6.3.15#6346)