Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4CDA7F2D5 for ; Mon, 2 Sep 2013 02:59:45 +0000 (UTC) Received: (qmail 49123 invoked by uid 500); 2 Sep 2013 02:59:43 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 48794 invoked by uid 500); 2 Sep 2013 02:59:42 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 48781 invoked by uid 99); 2 Sep 2013 02:59:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Sep 2013 02:59:41 +0000 X-ASF-Spam-Status: No, hits=-5.0 required=5.0 tests=RCVD_IN_DNSWL_HI,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of prvs=195018b424=matt.dickson@defence.gov.au designates 203.6.68.1 as permitted sender) Received: from [203.6.68.1] (HELO defence.gov.au) (203.6.68.1) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Sep 2013 02:59:35 +0000 From: "Dickson, Matt MR" To: "'user@accumulo.apache.org'" Date: Mon, 2 Sep 2013 12:59:08 +1000 Subject: RE: High Ingest on a single server [SEC=UNOFFICIAL] Thread-Topic: High Ingest on a single server [SEC=UNOFFICIAL] Thread-Index: Ac6nh3p5UL0kCDBhS6mzMUpl6vRQ8QAAKTXw Message-ID: <24070BEF0A3F684489AA943FD3439EF20586FA05E3@CARRXM06.drn.mil.au> References: <24070BEF0A3F684489AA943FD3439EF2058102AB9E@CARRXM06.drn.mil.au> <24070BEF0A3F684489AA943FD3439EF20586FA05E1@CARRXM06.drn.mil.au> <5223FD58.9030204@gmail.com> In-Reply-To: <5223FD58.9030204@gmail.com> Accept-Language: en-US, en-AU Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-protective-marking: VER=2012.3, NS=gov.au, SEC=UNOFFICIAL, ORIGIN=matt.dickson@defence.gov.au x-tituslabs-classifications-30: TLPropertyRoot=Titus;SEC=UNOFFICIAL; x-tituslabs-classificationhash-30: VgNFIFU9Hx+/nZJb9Kg7IoaiAIyfakuTIHka7FlTMN/gM5VljTeXV0+WFxvpqWX/tBxfPo+xcpu+PdISC80mHhcYF1U/b9FfOaXsSRFWGQguMw7c9EFmuLyeXOO6qVJEcHPXVpAOGsecnhgye5PurMk5snIpknwBvZUXpMShyNVSbl19m9K1fi8etWoRVM0OYSeq25cS6PMy9EoBJp7hvA== x-titus-version: 3.5.8.4 x-tituslabs-subjectpostlabel: [SEC=UNOFFICIAL] acceptlanguage: en-US, en-AU Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginalArrivalTime: 02 Sep 2013 02:59:09.0901 (UTC) FILETIME=[6567B7D0:01CEA788] X-Virus-Checked: Checked by ClamAV on apache.org UNOFFICIAL Just checked and there are a lot of 'not balancing because there are unhost= ed tablets' debug messages. Is this the same issue? -----Original Message----- From: Josh Elser [mailto:josh.elser@gmail.com] Sent: Monday, 2 September 2013 12:52 To: user@accumulo.apache.org Subject: Re: High Ingest on a single server [SEC=3DUNOFFICIAL] To verify that this what you're running into, you should see a message in t= he master*.debug.log that matches "not balancing because.*" On 09/01/2013 09:54 PM, John Vines wrote: > Try restarting the master. A few releases had a big where it would get=20 > stuck. > > Sent from my phone, please pardon the typos and brevity. > > On Sep 1, 2013 6:12 PM, "Dickson, Matt MR"=20 > > wrote: > > __ > > *UNOFFICIAL* > > Thanks Eric. > The tablet of concern has 2000 tablets while all others have 1000 so > it looks like the balancers aren't evening out the tablets per node > as expected. > Is there a way to force the balancer to run or rectify this > situation by moving tablets to alternate nodes? > > ---------------------------------------------------------------------= --- > *From:* Eric Newton [mailto:eric.newton@gmail.com > ] > *Sent:* Thursday, 29 August 2013 23:23 > *To:* user@accumulo.apache.org > *Subject:* Re: High Ingest on a single server [SEC=3DUNOFFICIAL] > > The balancers that ship with accumulo attempt to keep an equal > number of tablets on each server. An empty tablet, will be balanced > with the same weight as a 50G tablet. > > You can write a new balancer to take advantage of the properties of > the tablets, and any expected hotspots you have. > > > > On Thu, Aug 29, 2013 at 1:39 AM, Dickson, Matt MR > > > wrote: > > __ > > *UNOFFICIAL* > > We are seeing a single server that has less entries than all the > other nodes in the cluster. Accumulo now appears to be > directing higher ingest tablets to this node and its now getting > 7 times the ingest entries than all other nodes and is slowing > or load. Does Accumulo attempt to balance disk usage across the > nodes for a table by moving tablets and that is why we are > seeing this node ingesting more? > If not, is it possible to make accumulo rebalance the ingest > across all servers during a load? > Matt > >