Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BC3CB176C6 for ; Sat, 4 Oct 2014 16:05:00 +0000 (UTC) Received: (qmail 57004 invoked by uid 500); 4 Oct 2014 16:05:00 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 56951 invoked by uid 500); 4 Oct 2014 16:05:00 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 56941 invoked by uid 99); 4 Oct 2014 16:05:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 04 Oct 2014 16:05:00 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of david.medinets@gmail.com designates 74.125.82.44 as permitted sender) Received: from [74.125.82.44] (HELO mail-wg0-f44.google.com) (74.125.82.44) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 04 Oct 2014 16:04:56 +0000 Received: by mail-wg0-f44.google.com with SMTP id y10so3559855wgg.27 for ; Sat, 04 Oct 2014 09:04:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=gcxyhMGlkKtLGuPQIJJN41DXRrv9uCgkL9O2c1zoBXs=; b=ToaHlgE9e8bYoXWorHRCnJ46z10YI8L8n4a/gABEhPqGfx7yIJCfwYcemptd0TIf2P qSMIM2YHW7aALZUkvdV18cwlJ1AYdVPsSqATiae7FJMTSdzjoMMKvBL4wVoFwr1rumlR aRm1VUUsHtyhu/X7aUNmR/1MyXwg+vbfgI5T/hhmLNSeoEfuQON4lfg/W+o4a22QD8Nq +lvz3ZVyqyOdwwRk3gI/iQA9uuWjuVOaGHS7kuibzukGlng9mCkAwZBtPgo3ADiScbej 1yNasODTZtgBDKMJuubgG4QPrdNYn10DjupjLjkAr7d1PT6tN0xezh8jKc+s2WpSTDoT PnWQ== MIME-Version: 1.0 X-Received: by 10.180.77.163 with SMTP id t3mr6609311wiw.25.1412438675274; Sat, 04 Oct 2014 09:04:35 -0700 (PDT) Received: by 10.194.39.40 with HTTP; Sat, 4 Oct 2014 09:04:35 -0700 (PDT) In-Reply-To: References: Date: Sat, 4 Oct 2014 12:04:35 -0400 Message-ID: Subject: Re: Determining tablets assigned to table splits, and the number of rows in each tablet From: David Medinets To: accumulo-user Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org I did some work to explain these topics at https://github.com/medined/D4M_Schema/blob/master/docs/data_distribution.md. If you have the luxury of writing the ingest code you can use Cardinality Estimates using techniques described in https://github.com/medined/D4M_Schema/blob/master/docs/cardinality.md. On Sat, Oct 4, 2014 at 12:23 AM, Dylan Hutchison wrote: > This is for Accumulo 1.6. Suppose we have the table splits > > c > > g > > w > > > Does anyone know how to determine > > the number of tablets assigned to each table split range? > For this example, this is the number of tablets in the ranges (-Inf,c), > (c,g), (g,w), (w,Inf). Or is the design 1-1, that is, for each table split > range there is exactly one tablet? > the number of rows inside all the tablets occupying a table split range? > For this example, this is the total number of rows among all tablets in the > ranges (-Inf,c), (c,g), (g,w), (w,Inf). > > We use this count to verify how well manually set table splits are load > balancing in the tables. > > Some context: I wrote functions that found these numbers two years ago > working on D4M in Accumulo 1.5. I took the dark route of using non-public > Accumulo API to get TabletServer information, get TabletStats information, > and find the matchings to a table's splits by scanning the extents listed in > the METATABLE. I can share the code if anyone is curious. It's not pretty, > but it did the job. > > Moving forward as we aim to upgrade to Accumulo 1.6, we should determine the > tablet split information the right way, not by reverse engineering Accumulo. > Any suggestions? > > Thanks, > Dylan Hutchison > > -- > www.cs.stevens.edu/~dhutchis