Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8FA9C18461 for ; Thu, 10 Dec 2015 18:35:57 +0000 (UTC) Received: (qmail 15815 invoked by uid 500); 10 Dec 2015 18:35:53 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 15699 invoked by uid 500); 10 Dec 2015 18:35:53 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 15689 invoked by uid 99); 10 Dec 2015 18:35:53 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 10 Dec 2015 18:35:53 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id CDF65C759A for ; Thu, 10 Dec 2015 18:35:52 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.001 X-Spam-Level: * X-Spam-Status: No, score=1.001 tagged_above=-999 required=6.31 tests=[FSL_HELO_BARE_IP_2=1, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id j8YrHr62_8rd for ; Thu, 10 Dec 2015 18:35:43 +0000 (UTC) Received: from relayvx11b.securemail.intermedia.net (relayvx11b.securemail.intermedia.net [64.78.52.184]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id F0FA2429C7 for ; Thu, 10 Dec 2015 18:35:42 +0000 (UTC) Received: from securemail.intermedia.net (localhost [127.0.0.1]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by emg-ca-1-1.localdomain (Postfix) with ESMTPS id 467D953EE7; Thu, 10 Dec 2015 10:35:42 -0800 (PST) Subject: Re: How do I run ZKFC on a different node that is not the name node? MIME-Version: 1.0 x-echoworx-msg-id: b44c7534-91a6-4b69-ba3a-28a1b6e4799c x-echoworx-emg-received: Thu, 10 Dec 2015 10:35:42.177 -0800 x-echoworx-action: delivered Received: from 10.254.155.14 ([10.254.155.14]) by emg-ca-1-1 (JAMES SMTP Server 2.3.2) with SMTP ID 25; Thu, 10 Dec 2015 10:35:42 -0800 (PST) Received: from MBX080-W4-CO-1.exch080.serverpod.net (unknown [10.224.117.101]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by emg-ca-1-1.localdomain (Postfix) with ESMTPS id F209953EE7; Thu, 10 Dec 2015 10:35:41 -0800 (PST) Received: from MBX080-W4-CO-2.exch080.serverpod.net (10.224.117.102) by MBX080-W4-CO-1.exch080.serverpod.net (10.224.117.101) with Microsoft SMTP Server (TLS) id 15.0.1130.7; Thu, 10 Dec 2015 10:35:40 -0800 Received: from MBX080-W4-CO-2.exch080.serverpod.net ([10.224.117.102]) by mbx080-w4-co-2.exch080.serverpod.net ([10.224.117.102]) with mapi id 15.00.1130.005; Thu, 10 Dec 2015 10:35:40 -0800 From: Chris Nauroth To: F21 , "user@hadoop.apache.org" Thread-Topic: How do I run ZKFC on a different node that is not the name node? Thread-Index: AQHRMwBfvUEk6tUKf02DZ7hbmJDb257EfFuA Date: Thu, 10 Dec 2015 18:35:39 +0000 Message-ID: References: <5668FA96.2040903@gmail.com> In-Reply-To: <5668FA96.2040903@gmail.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-messagesentrepresentingtype: 1 x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [50.248.208.113] x-source-routing-agent: Processed Content-Type: text/plain; charset="us-ascii" Content-ID: Content-Transfer-Encoding: quoted-printable Hello, I can see 2 potential problems. 1. I see property "dfs.ha.namenodescluster" below. If that is really what you have in your hdfs-site.xml, and not just a copy-paste error, then it looks like there is a missing period. "dfs.ha.namenodes.cluster" would be the correct property name. 2. Standard deployment is to run ZKFC on the same host as the NameNode that it watches. This eliminates a potential failure mode of network partition between the ZKFC running on one host and the NameNode running on another host. The ZKFC startup logic tries to pair up automatically with the correct NameNode by scanning the configured NameNode addresses to see which one is a network address on the same host. If you aren't running the ZKFC on the same host as the NameNode, then this automatic detection logic won't work and it will abort. If you can't arrange to run the ZKFC on the same host as the NameNode, then it's possible to override this automatic detection by explicitly pointing the ZKFC to one of the NameNodes in hdfs-site.xml. You would need to set property dfs.ha.namenode.id to either nn1 or nn2. dfs.ha.namenode.id The ID of this namenode. If the namenode ID is not configured it is determined automatically by matching the local node's address with the configured address. I hope this helps. --Chris Nauroth On 12/9/15, 8:07 PM, "F21" wrote: >I am receiving this error when trying to format zookeeper using the zkfc: > >Exception in thread "main" >org.apache.hadoop.HadoopIllegalArgumentException: Could not get the >namenode ID of this node. You may run zkfc on the node other than >namenode. > at=20 >org.apache.hadoop.hdfs.tools.DFSZKFailoverController.create(DFSZKFailoverC >ontroller.java:137) > at=20 >org.apache.hadoop.hdfs.tools.DFSZKFailoverController.main(DFSZKFailoverCon >troller.java:186) > >There wasn't any information regarding this on the web. I am running the >namenode, journalnode, datanode and zkfc within separate docker >containers. > >This is the configuration in the zkfc container (I have launched 3 of >these): > > > > > dfs.permissions.superusergroup > hadoop > > >dfs.client.failover.proxy.provider.cluster >org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyPr >ovider > > > dfs.ha.fencing.methods > shell(/bin/true) > > > ha.zookeeper.quorum > zookeeper:2181 > > > dfs.nameservices > cluster > > > dfs.ha.namenodescluster > nn1,nn2 > > > dfs.namenode.rpc-address.cluster.nn1 > namenode1:8020 > > > dfs.namenode.rpc-address.cluster.nn2 > namenode2:8020 > > > dfs.namenode.http-address.cluster.nn1 > namenode1:50070 > > > dfs.namenode.http-address.cluster.nn2 > namenode2:50070 > > > >Any ideas how I can get this working? > >--------------------------------------------------------------------- >To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org >For additional commands, e-mail: user-help@hadoop.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org For additional commands, e-mail: user-help@hadoop.apache.org