Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EA68A200BA1 for ; Mon, 17 Oct 2016 17:57:08 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id E8A74160AEC; Mon, 17 Oct 2016 15:57:08 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3B8CD160AE2 for ; Mon, 17 Oct 2016 17:57:08 +0200 (CEST) Received: (qmail 68020 invoked by uid 500); 17 Oct 2016 15:57:06 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 68003 invoked by uid 99); 17 Oct 2016 15:57:06 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 17 Oct 2016 15:57:06 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 28E5F1A060A for ; Mon, 17 Oct 2016 15:57:06 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.999 X-Spam-Level: * X-Spam-Status: No, score=1.999 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=weborama-com.20150623.gappssmtp.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id d-RYDhesSuAG for ; Mon, 17 Oct 2016 15:57:03 +0000 (UTC) Received: from mail-io0-f178.google.com (mail-io0-f178.google.com [209.85.223.178]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 706A05FBFC for ; Mon, 17 Oct 2016 15:57:02 +0000 (UTC) Received: by mail-io0-f178.google.com with SMTP id q192so192982272iod.0 for ; Mon, 17 Oct 2016 08:57:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=weborama-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=nPHNJtQ7epKz4vhMt3QOGkMIzKHfM1hfDVvbOq5obwo=; b=cZakwUoU9pzwFmkL0xCdN46TNbGO/czoL98zMDA6rW9HG1StQueL5o3RyuppWDyIfm HdYlt1uU16TbfhS0Ga+5KR2RVlgnDQROkUFebe6v2x3vxqQQYKVmjMfrh9Lr3EkiA5Yw meZD0DogjN397k6kINq3Eo89EramqIzZbI1hQ6bJ+M4TWPEPv79AA+286TaQFr+PUWMU E1PX69aST71n8/lCr42sBkyrjzPBxMWF9v9Tuwp6XZBSw5BTj4lMp/8xSssVaYBlUHjq MfZI3WRrmbNOTzvAy+tGbb7CORENMuTW6llI7D8RH9NBsj3SZvWlVLyEDcIxU8aWh1MO ilYg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=nPHNJtQ7epKz4vhMt3QOGkMIzKHfM1hfDVvbOq5obwo=; b=PRnyLhytBOx3A7/QclVbY3trm9omjgauy1NyQ3Z6P/iqunhS+ItjXzeqMu0namnUJX 5rl1TTcPzm8AjTQLiAC40yrQLAYo7LWWGuqj0ll1bzBtvRu6pO8QO1vNH64j39INVy+X HeGt2qVkh0Y5wfty4VCxtNsQ5fkgrO4ibi2eSGKYJC+0irEQjFFxVQjO8C1LbW8KTppu orFpol1cdTTfZEJMvPegQVwMS0BBIIhcbrXicJ07b+aWEvZlcAHiMqLAeNGb4qk2ne7s YiN2ulFzG2q9Ei4lib7sYa5rFYhXxz/t5nNQOPpsb09hSO+eLaGCqoV7BjXmtwnITUxZ r5xQ== X-Gm-Message-State: AA6/9RnvzBt7avzP/XA3CCeRs1MFml2wlGakoGRaw6UpVE7kM/VcdMdVi+tZIAKxBZEkWBOw3bkx1e6Ukli+r5If X-Received: by 10.107.147.67 with SMTP id v64mr23660496iod.60.1476719820455; Mon, 17 Oct 2016 08:57:00 -0700 (PDT) MIME-Version: 1.0 Received: by 10.79.87.132 with HTTP; Mon, 17 Oct 2016 08:56:59 -0700 (PDT) In-Reply-To: References: From: Alexander Ilyin Date: Mon, 17 Oct 2016 17:56:59 +0200 Message-ID: Subject: Re: HBase restart without region reassigning To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=94eb2c05460454d514053f11a081 archived-at: Mon, 17 Oct 2016 15:57:09 -0000 --94eb2c05460454d514053f11a081 Content-Type: text/plain; charset=UTF-8 Hi Dima, These are instances in the cloud and we're using Consul for name resolution. Regarding network settings, your question is a bit broad... Which settings would you recommend to check first? On Mon, Oct 17, 2016 at 5:28 PM, Dima Spivak wrote: > Hey Alexander, > > Could something be amiss in your network settings? Seeing phantom datanodes > could be tripping things up. Are these physical machines or instances in > the cloud? > > On Monday, October 17, 2016, Alexander Ilyin > wrote: > > > Hi, > > > > We have a 7-node HBase cluster (version 1.1.2) and we change some of its > > settings from time to time which requires a restart. The problem is that > > every time after the restart load balancer reassigns the regions making > > data locality low. > > > > To address this issue we tried the settings described here: > > https://issues.apache.org/jira/browse/HBASE-6389, > > "hbase.master.wait.on.regionservers.interval" in particular. We tried it > > two times in slightly different ways but neither of them worked. First > time > > we did a rolling restart (master, then each of datanodes) and we saw 14 > > datanodes instead of 7 in Master UI. Half of them had the regions on it > > while the other half was empty. We restarted master only and we got 7 > empty > > datanodes in Master UI. After that we rollbacked the setting. > > > > Second time we restarted master and datanodes at the same time but master > > failed to read meta table, moved it to a different datanode and > reassigned > > the regions again. > > > > Please advise on how to use hbase.master.wait.on.regionservers.* > settings > > properly. Launching major compactions for all the tables after each > config > > change seems to be an overkill. Attaching Master server logs with > relevant > > lines for two attempts mentioned above. > > > > Thanks in advance. > > > > > -- > -Dima > --94eb2c05460454d514053f11a081--