Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AAB7C1812B for ; Fri, 18 Sep 2015 15:09:22 +0000 (UTC) Received: (qmail 72939 invoked by uid 500); 18 Sep 2015 15:09:17 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 72888 invoked by uid 500); 18 Sep 2015 15:09:17 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 72878 invoked by uid 99); 18 Sep 2015 15:09:17 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Sep 2015 15:09:17 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 0AF7CC084E for ; Fri, 18 Sep 2015 15:09:17 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.102 X-Spam-Level: X-Spam-Status: No, score=-0.102 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id bW2PS5sfU5lZ for ; Fri, 18 Sep 2015 15:09:16 +0000 (UTC) Received: from mail-qg0-f44.google.com (mail-qg0-f44.google.com [209.85.192.44]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id DC45F20F1E for ; Fri, 18 Sep 2015 15:09:15 +0000 (UTC) Received: by qgev79 with SMTP id v79so40860030qge.0 for ; Fri, 18 Sep 2015 08:09:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; bh=p6wbZ5yzENAy29zvpn3GkkTI20Ao4VezAOU60p27Yvc=; b=KYt/6muc40eCvjue6N9GshmQoTBxXQViRRUbq8lIAuttkMxs+7v3ToXCESkCAOZch4 8QKcDp1KMW3AGYxiwSoRGq6KoXJvomen2sV/Ooctz64exEZiOrlRZVRbOScofCsdQL0m joqdslHGy4sOJeLCJn/PcCxgI4xkYPIHvx9qcbLwZ557S7EqHqm5p1aeMiF706NYq93A 2vI+IFKoum92uVGDl4tX596d9xMizUlhGjz8VBpNzXwoQZVm7zGmOwgdAflPEmmDd6/L /KJmtKeFsWOX4I2ghP5xC2fhYpvu/IlaDee7PaMFnDESr/tnZ8oWP84pVrnU/Kci7kt1 RMcQ== X-Received: by 10.140.22.48 with SMTP id 45mr7060633qgm.23.1442588954893; Fri, 18 Sep 2015 08:09:14 -0700 (PDT) Received: from hw10447.local (pool-68-134-10-53.bltmmd.fios.verizon.net. [68.134.10.53]) by smtp.googlemail.com with ESMTPSA id k64sm3748485qhk.2.2015.09.18.08.09.14 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 18 Sep 2015 08:09:14 -0700 (PDT) Message-ID: <55FC291A.3060303@gmail.com> Date: Fri, 18 Sep 2015 11:09:14 -0400 From: Josh Elser User-Agent: Postbox 3.0.11 (Macintosh/20140602) MIME-Version: 1.0 To: user@accumulo.apache.org Subject: Re: Warning about outstanding migrations References: <55FBA3AE.60402@orkash.com> In-Reply-To: <55FBA3AE.60402@orkash.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit A migration is the Master choosing to move a Tablet from TabletServerX to TabletServerY. Balancing is the process of moving a Tablet, again from one TabletServer to another, per the configured LoadBalancer implementation to meet some Tablet distribution. In this case, there is a migration for one Tablet. If the Balancer were to run now, the current state of Tablets it would operate on would be invalidate because the current distribution of Tablets doesn't match the desired distribution. A good example would be a system with 10 tablets on one tabletserver. A second tabletserver is started, but has no tablets yet. The balancer would not want to rebalance [10, 0], so it would wait until the migrations have been processed. As long as balancing is happening (you don't constantly see such a warning message), you can safely ignore it. Sometimes a server will take a little time to process the migration due to load. However, we have had bugs in the past where the Master gets stuck thinking there are outstanding migrations and never runs the balancer. This causes lots of problems after some time. If you are running into this, just restart the master when you see this happen and consider updating to a newer version of Accumulo (I'm not aware of any current bugs with this issue). - Josh mohit.kaushik wrote: > I am getting the following error usually in my master log file.. > > 2015-09-17 10:22:55,798 [balancer.DefaultLoadBalancer] WARN : Not > balancing due to 1 outstanding migrations. > 2015-09-17 10:22:55,798 [balancer.DefaultLoadBalancer] WARN : Not > balancing due to 1 outstanding migrations. > 2015-09-17 10:22:55,798 [balancer.DefaultLoadBalancer] WARN : Not > balancing due to 1 outstanding migrations. > 2015-09-17 10:23:00,823 [master.EventCoordinator] INFO : Migrating 1 > more tablets, 1 total > 2015-09-17 10:23:01,034 [master.EventCoordinator] INFO : [Normal > Tablets]: 1 tablets unloaded > 2015-09-17 10:23:01,071 [master.EventCoordinator] INFO : tablet > 7n;EsBihar_1234|modi_1234< was unloaded from orkash1:9997 > 2015-09-17 10:23:01,272 [master.Master] INFO : Assigning 1 tablets > 2015-09-17 10:23:01,352 [master.EventCoordinator] INFO : [Normal > Tablets]: 1 tablets are UNASSIGNED > 2015-09-17 10:23:01,352 [master.EventCoordinator] INFO : [Normal > Tablets]: 474 tablets are HOSTED > 2015-09-17 10:23:01,381 [master.EventCoordinator] INFO : tablet > 7n;EsBihar_1234|modi_1234< was loaded on orkash2:9997 > 2015-09-17 10:23:01,606 [master.EventCoordinator] INFO : [Normal > Tablets]: 475 tablets are HOSTED > > > Can anybody explain, what are the outstanding migrations in Accumulo and > how do they effect balancing? > > Thanks > Mohit Kaushik//