Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id DF3A1200C01 for ; Thu, 19 Jan 2017 16:43:28 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id DC34A160B54; Thu, 19 Jan 2017 15:43:28 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id D44C0160B42 for ; Thu, 19 Jan 2017 16:43:27 +0100 (CET) Received: (qmail 15762 invoked by uid 500); 19 Jan 2017 15:43:26 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 15750 invoked by uid 99); 19 Jan 2017 15:43:26 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Jan 2017 15:43:26 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id C777BC036F for ; Thu, 19 Jan 2017 15:43:25 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, KAM_LOTSOFHASH=0.25, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id Bg-w6f-kHR7c for ; Thu, 19 Jan 2017 15:43:22 +0000 (UTC) Received: from mail-oi0-f42.google.com (mail-oi0-f42.google.com [209.85.218.42]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id BD97B5F282 for ; Thu, 19 Jan 2017 15:43:21 +0000 (UTC) Received: by mail-oi0-f42.google.com with SMTP id j15so26803057oih.2 for ; Thu, 19 Jan 2017 07:43:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=FXGFRx5+92+C2AUYvh5VV7R+RugjNoOcHSqIv3KQJV4=; b=SLPrMDBeAY6/ar63w47c2TEc4k+M2D6/ipEPqklShdlgIBbggZGEgYiPNMnAFvnPwE /5bvVjW00Hum1RDQzfOFysQjNtStGgVgfktMStgsnU3rNMtkLckmRoIAjrs0Ni4zldMg HqfFXRQGOsDyFKIOSoG13PKm+OChSzCd+0IEu67TMo25VLyBnOuWAkGHzNHPgZvxBnoh FCF+FyW9VoXX7RTZxBhAektDuP6xb0uYOY+A9IrNNfdVB7dzeM+92OzMUBjdydqG18Cs 0Qwk9Xvx3j/B/MA3W/fLkS1GxcN5mYrrGlzd9pAQCDbCin0e21x59TiCoCz+vH/tFry8 s2nA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=FXGFRx5+92+C2AUYvh5VV7R+RugjNoOcHSqIv3KQJV4=; b=fpeX2lWQ3LcvnPjCsbv4qdwhD4HpMkogS9ck+pS7w8GrcqZSrLx30ba8Ki/XCDpnWa Ay3NPjZD11eqfkQOpYZv32Bt5enGXXoPd8DdWyTZcImEFo6NMxUf9XNURFbiLFog8woZ QhfbL8xDUwJtReqErUKcEhKKk8SSMzv50U27oEexKqWDf1c2HqNAUOO/wAfPOKddQ7+F /H3M6NQ0QHsIt3uJGpIl4hY9XoqCIJ6Gg+MuzN5DUoXD5CopzqMf7qa7DYjvo17gkmT3 3F+Obz2RVNp2WfxuvQWUZsboAM3H3Bv95B7nnBIuPYJMxv4qetYpDpZuKT9dIQYa7KjO A09g== X-Gm-Message-State: AIkVDXK0GJgOK4EiMIIl2GZl6zwDgY81d6BhB9Rk4Y4i4Lx9qhlf6fVstMBXZCfVVd8SJFdvOGWOMYCodM+7cA== X-Received: by 10.202.96.134 with SMTP id u128mr4236488oib.172.1484840596053; Thu, 19 Jan 2017 07:43:16 -0800 (PST) MIME-Version: 1.0 Received: by 10.202.213.131 with HTTP; Thu, 19 Jan 2017 07:43:15 -0800 (PST) In-Reply-To: References: From: Yu Li Date: Thu, 19 Jan 2017 23:43:15 +0800 Message-ID: Subject: Re: hbase has problems with two hostname To: Hbase-User Content-Type: multipart/alternative; boundary=001a1140f66646a58b05467464e9 archived-at: Thu, 19 Jan 2017 15:43:29 -0000 --001a1140f66646a58b05467464e9 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable If you have two network cards and one hostname binding on each, please make sure to set the hostname you'd like to use in HBase in /etc/hosts on each nodes, including HMaster and every RS. Hope this helps. Best Regards, Yu On 17 January 2017 at 23:47, Dima Spivak wrote: > Is there any other DNS server running that might be confusing reverse > lookup? What happens if you run `host YOUR_RS_IP_ADDRESS`? > > And what kind of machines are you using in your deployment? > > Cheers, > > On Mon, Jan 16, 2017 at 11:34 PM C R wrote: > > > Thanks, > > > > > > > > > > > > I deployed my HBase very simply, which has one Master and three > > regionservers. > > > > > > > > > > > > [hbase@bjsh19-16-30 conf]$ more regionservers > > > > bjsh19-16-33.qbos.com > > > > bjsh19-16-34.qbos.com > > > > bjsh19-16-35.qbos.com > > > > [hbase@bjsh19-16-30 conf]$ more hbase-site.xml > > > > > > > > ... > > > > > > > > > > > > > > > > zookeeper.znode.parent > > > > /hbase117 > > > > > > > > > > > > hbase.rootdir > > > > hdfs://bidc/hbase117 > > > > > > > > > > > > hbase.zookeeper.quorum > > > > bjsh19-16-30.qbos.com,bjsh19-16-31.qbos.com, > > bjsh19-16-32.qbos.com > > > > > > > > > > > > hbase.cluster.distributed > > > > true > > > > > > > > > > > > hbase.zookeeper.property.clientPort > > > > 2181 > > > > > > > > > > > > > > > > > > > > The special place is the file /etc/hosts with one ip mapping to two > > hostnames on all nodes,so it will have the message: > > > > > > > > ... > > > > > > > > the server that tried to transition was wjsa-tsl05,16020,1484623636195 > not > > the expected bjsh19-16-34.qbos.com,16020,1484623636195 > > > > > > > > ... > > > > > > > > > > > > ________________________________ > > > > =E5=8F=91=E4=BB=B6=E4=BA=BA: Dima Spivak > > > > =E5=8F=91=E9=80=81=E6=97=B6=E9=97=B4: 2017=E5=B9=B41=E6=9C=8817=E6=97= =A5 4:50 > > > > =E6=94=B6=E4=BB=B6=E4=BA=BA: user@hbase.apache.org > > > > =E4=B8=BB=E9=A2=98: Re: hbase has problems with two hostname > > > > > > > > Hi C R, > > > > > > > > Like many Hadoop-like services, HBase is pretty temperamental about > > > > requiring forward and reverse DNS to work properly. FWIW, the > configuration > > > > file where you can populate RegionServers doesn't tend to matter as lon= g > as > > > > the hbase-site.xml file is populated correctly (it's just used to start > > > > daemons from one place). > > > > > > > > If you pass along more details about how exactly you're deploying HBase= , > we > > > > might be able to give more advice. > > > > > > > > On Mon, Jan 16, 2017 at 8:00 PM C R wrote: > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > more /etc/hosts > > > > > > > > > > > > > > > ... > > > > > > > > > > > > > > > > > > > > > > > > > 10.19.16.31 bjsh19-16-31.qbos.com wjsa-tsl02 > > > > > > > > > > > > > > > ... > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > There will have six regionservers listed in web console, but > > > > > > > > > > only three in the configuration file, metadata tables also are not > > online > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > Hmaster will be dead after a while. > > > > > > > > > > > > > > > what should I do? > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > snapshot: > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > 2017-01-17 11:45:24,394 INFO > > > > > [MASTER_SERVER_OPERATIONS-bjsh19-16-30:16000-0] > > master.AssignmentManager: > > > > > Assigning > > hbase:namespace,,1484623643279.30fab746cb3b6ceadcbda421459204b9. > > > > > to bjsh19-16-34.qbos.com,16020,1484623636195 > > > > > > > > > > > > > > > 2017-01-17 11:45:24,395 INFO [bjsh19-16-30:16000.activeMasterManager= ] > > > > > master.AssignmentManager: Joined the cluster in 23ms, failover=3Dtrue > > > > > > > > > > > > > > > 2017-01-17 11:50:24,314 FATAL [bjsh19-16-30:16000.activeMasterManager= ] > > > > > master.HMaster: Failed to become active master > > > > > > > > > > > > > > > java.io.IOException: Timedout 300000ms waiting for namespace table to > be > > > > > assigned > > > > > > > > > > > > > > > at > > > > > > > org.apache.hadoop.hbase.master.TableNamespaceManager. > start(TableNamespaceManager.java:104) > > > > > > > > > > > > > > > at > > > > > org.apache.hadoop.hbase.master.HMaster.initNamespace(HMaster.java:986= ) > > > > > > > > > > > > > > > at > > > > > > > org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitializati > on(HMaster.java:780) > > > > > > > > > > > > > > > at > > > > > org.apache.hadoop.hbase.master.HMaster.access$500(HMaster.java:183) > > > > > > > > > > > > > > > at > > org.apache.hadoop.hbase.master.HMaster$1.run(HMaster.java:1652) > > > > > > > > > > > > > > > at java.lang.Thread.run(Thread.java:745) > > > > > > > > > > > > > > > 2017-01-17 11:50:24,315 FATAL [bjsh19-16-30:16000.activeMasterManager= ] > > > > > master.HMaster: Master server abort: loaded coprocessors are: [] > > > > > > > > > > > > > > > 2017-01-17 11:50:24,316 FATAL [bjsh19-16-30:16000.activeMasterManager= ] > > > > > master.HMaster: Unhandled exception. Starting shutdown. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > ... > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > 2017-01-17 11:27:17,926 INFO [regionserver/ > > > > > bjsh19-16-34.qbos.com/10.19.16.34:16020] regionserver.HRegionServer: > > > > > Serving as wjsa-tsl05,16020,1 > > > > > > > > > > > > > > > 484623636195, RpcServer on bjsh19-16-34.qbos.com/10.19.16.34:16020, > > > > > sessionid=3D0x154563e43e30179 > > > > > > > > > > > > > > > 2017-01-17 11:27:17,934 INFO [regionserver/ > > > > > bjsh19-16-34.qbos.com/10.19.16.34:16020] > > quotas.RegionServerQuotaManager: > > > > > Quota support disabled > > > > > > > > > > > > > > > 2017-01-17 11:27:23,966 INFO > > > > > [PriorityRpcServer.handler=3D14,queue=3D0,port=3D16020] > > > > > regionserver.RSRpcServices: Open hbase:namespace,,148462364327 > > > > > > > > > > > > > > > 9.30fab746cb3b6ceadcbda421459204b9. > > > > > > > > > > > > > > > 2017-01-17 11:27:24,008 WARN [RS_OPEN_REGION-bjsh19-16-34:16020-0] > > > > > zookeeper.ZKAssign: regionserver:16020-0x154563e43e30179, > > quorum=3Dbjsh19-16 > > > > > > > > > > > > > > > -30:2181,bjsh19-16-31:2181,bjsh19-16-32:2181, baseZNode=3D/hbase115ne= w > > > > > Attempt to transition the unassigned node for > 30fab746cb3b6ceadcbda421459 > > > > > > > > > > > > > > > 204b9 from M_ZK_REGION_OFFLINE to RS_ZK_REGION_OPENING failed, the > server > > > > > that tried to transition was wjsa-tsl05,16020,1484623636195 not the > > > > > > > > > > > > > > > expected bjsh19-16-34.qbos.com,16020,1484623636195 > > > > > > > > > > > > > > > 2017-01-17 11:27:24,008 WARN [RS_OPEN_REGION-bjsh19-16-34:16020-0] > > > > > coordination.ZkOpenRegionCoordination: Failed transition from OFFLINE > to > > O > > > > > > > > > > > > > > > PENING for region=3D30fab746cb3b6ceadcbda421459204b9 > > > > > > > > > > > > > > > 2017-01-17 11:27:24,008 WARN [RS_OPEN_REGION-bjsh19-16-34:16020-0] > > > > > handler.OpenRegionHandler: Region was hijacked? Opening cancelled for > > enco > > > > > > > > > > > > > > > dedName=3D30fab746cb3b6ceadcbda421459204b9 > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > --001a1140f66646a58b05467464e9--