Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D4B5E9824 for ; Mon, 9 Apr 2012 07:12:12 +0000 (UTC) Received: (qmail 75851 invoked by uid 500); 9 Apr 2012 07:12:11 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 75551 invoked by uid 500); 9 Apr 2012 07:12:05 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 75518 invoked by uid 99); 9 Apr 2012 07:12:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 09 Apr 2012 07:12:04 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_FONT_SIZE_LARGE,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of dwivedishashwat@gmail.com designates 209.85.216.169 as permitted sender) Received: from [209.85.216.169] (HELO mail-qc0-f169.google.com) (209.85.216.169) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 09 Apr 2012 07:11:57 +0000 Received: by qcsd16 with SMTP id d16so2714872qcs.14 for ; Mon, 09 Apr 2012 00:11:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=wIr5//ysJnd6AVOi47UcGTIzXswfuUjl1Sy3EqzQQy8=; b=AGw9HZUnnZ8wIsxhaD5nUR1o4lHLGCUK28na3vGv+Djh3iQYPhOlhakOwwA35F+zLy VQe2creurHCv3YRlN08u+ylGzmLHLC3LVrxOyMBbCvzEIkwLxnzz4u2V4Qeu17VV5kDm X/rH4B3OM7vbKT3QsIouxuWpJbErKl0m+oMNGGrt5Q/v2L2HqUGtxn01Isg9Qgr+jXSL IP/Dh+FSztjFXFXDgtEvLbKMDmaTwBwvhNHBmelkhF3PAR3p9wwJHAQDlQtZRMMI1aaJ iNiK+oDyvmiEILKU/BU7QORmwD8qyyA06IURaL8hfwO8CWPdDgHxHq6oasAsyQ1rc4/9 5TFg== MIME-Version: 1.0 Received: by 10.229.112.1 with SMTP id u1mr2458137qcp.100.1333955496860; Mon, 09 Apr 2012 00:11:36 -0700 (PDT) Received: by 10.229.96.11 with HTTP; Mon, 9 Apr 2012 00:11:36 -0700 (PDT) In-Reply-To: References: Date: Mon, 9 Apr 2012 12:41:36 +0530 Message-ID: Subject: Re: HMaster shutdown when a DNS address cannot be solved From: shashwat shriparv To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=bcaec524de51895d3004bd39b7a0 X-Virus-Checked: Checked by ClamAV on apache.org --bcaec524de51895d3004bd39b7a0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Since your domain has changed once try to ssh to the new short name you have added, it will ask to add it to known host just yes to that question, and check if you can ping to the name you have now, On Mon, Apr 9, 2012 at 5:19 AM, Amandeep Khurana wrote: > +user > (bcc: dev) > > Mikael, > > Such questions are better suited for the user mailing list. You'll > find more people talking about issues that they ran into and possibly > get answers to your questions faster. > > Hadoop internally using a form of the linux 'hostname' command from > within Java. When servers report into the master, they register with > that hostname. Now, if the hosts cannot be reached from outside > through that name, you'll run into this issue. In other words, you > need a working DNS to get Hadoop/HBase to work properly. In your > case, there is no way for the FQDN (server16.doman=E2=80=A6.) to get mapp= ed to > the IP address it seems. You need to fix your host resolution and > restore it to the working state that it was in earlier. > > Hope this helps. > > -Amandeep > > > On Sun, Apr 8, 2012 at 11:36 AM, Mikael Sitruk > wrote: > > > > Hi devs. > > > > I have a strange situation with my cluster when an address cannot be > > resolved. > > Few days ago I had two entries in a DNS file, so a computer could be > found > > either via or . > > Now the domain entries in the DNS resolving was removed, (so only the > short > > name exists) when i try to start the cluster the master fail indicating > > that the server address cannot be resolved see below... > > Any help appreciated??? > > BTW this is hadoop-1.0.0 and HBase-0.92.0 > > > > 2012-04-08 21:06:45,496 INFO > > org.apache.hadoop.hbase.catalog.CatalogTracker: Failed verification of > > -ROOT-,,0 at address=3Dserver118,60020,1333842630426; > > org.apache.hadoop.hbase.NotServingRegionException: org.apache.had > > oop.hbase.NotServingRegionException: Region is not online: -ROOT-,,0 > > 2012-04-08 21:06:45,497 INFO > > org.apache.hadoop.hbase.catalog.RootLocationEditor: Unsetting ROOT regi= on > > location in ZooKeeper > > 2012-04-08 21:06:45,969 INFO > > org.apache.hadoop.hbase.master.handler.OpenedRegionHandler: Handling > OPENED > > event for -ROOT-,,0.70236052 from server119,60020,1333908308851; deleti= ng > > unassigned node > > 2012-04-08 21:06:45,973 INFO > > org.apache.hadoop.hbase.master.AssignmentManager: The master has opened > the > > region -ROOT-,,0.70236052 that was online on > server119,60020,1333908308851 > > 2012-04-08 21:06:45,976 INFO org.apache.hadoop.hbase.master.HMaster: > -ROOT- > > assigned=3D1, rit=3Dfalse, location=3Dserver119,60020,1333908308851 > > 2012-04-08 21:06:46,041 FATAL org.apache.hadoop.hbase.master.HMaster: > > Master server abort: loaded coprocessors are: [] > > 2012-04-08 21:06:46,042 FATAL org.apache.hadoop.hbase.master.HMaster: > > Unhandled exception. Starting shutdown. > > java.net.UnknownHostException: unknown host: server116. > > at > > > org.apache.hadoop.hbase.ipc.HBaseClient$Connection.(HBaseClient.jav= a:227) > > at > > > org.apache.hadoop.hbase.ipc.HBaseClient.getConnection(HBaseClient.java:10= 16) > > at > > org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:878) > > at > > > org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcE= ngine.java:150) > > at $Proxy12.getProtocolVersion(Unknown Source) > > at > > > org.apache.hadoop.hbase.ipc.WritableRpcEngine.getProxy(WritableRpcEngine.= java:183) > > at > org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:303) > > at > org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:280) > > at > org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:332) > > at > > org.apache.hadoop.hbase.ipc.HBaseRPC.waitForProxy(HBaseRPC.java:236) > > at > > > org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementati= on.getHRegionConnection(HConnectionManager.java:1278) > > at > > > org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementati= on.getHRegionConnection(HConnectionManager.java:1235) > > at > > > org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementati= on.getHRegionConnection(HConnectionManager.java:1222) > > at > > > org.apache.hadoop.hbase.catalog.CatalogTracker.getCachedConnection(Catalo= gTracker.java:564) > > at > > > org.apache.hadoop.hbase.catalog.CatalogTracker.getMetaServerConnection(Ca= talogTracker.java:422) > > at > > > org.apache.hadoop.hbase.catalog.CatalogTracker.waitForMeta(CatalogTracker= .java:478) > > at > > > org.apache.hadoop.hbase.catalog.CatalogTracker.waitForMetaServerConnectio= n(CatalogTracker.java:503) > > at > > > org.apache.hadoop.hbase.catalog.CatalogTracker.verifyMetaRegionLocation(C= atalogTracker.java:674) > > at > > > org.apache.hadoop.hbase.master.HMaster.assignRootAndMeta(HMaster.java:575= ) > > at > > > org.apache.hadoop.hbase.master.HMaster.finishInitialization(HMaster.java:= 491) > > at org.apache.hadoop.hbase.master.HMaster.run(HMaster.java:326) > > at java.lang.Thread.run(Thread.java:662) > > 2012-04-08 21:06:46,044 INFO org.apache.hadoop.hbase.master.HMaster: > > Aborting > > 2012-04-08 21:06:46,044 INFO org.apache.hadoop.ipc.HBaseServer: Stoppin= g > > server on 60000 > > > > Mikael.S > --=20 =E2=88=9E Shashwat Shriparv --bcaec524de51895d3004bd39b7a0--