Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 78483D507 for ; Wed, 12 Dec 2012 18:17:48 +0000 (UTC) Received: (qmail 81515 invoked by uid 500); 12 Dec 2012 18:17:48 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 81452 invoked by uid 500); 12 Dec 2012 18:17:48 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 81444 invoked by uid 99); 12 Dec 2012 18:17:48 -0000 Received: from minotaur.apache.org (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Dec 2012 18:17:48 +0000 Received: from localhost (HELO mail-ie0-f169.google.com) (127.0.0.1) (smtp-auth username vines, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Dec 2012 18:17:47 +0000 Received: by mail-ie0-f169.google.com with SMTP id c14so2575709ieb.0 for ; Wed, 12 Dec 2012 10:17:47 -0800 (PST) MIME-Version: 1.0 Received: by 10.50.88.199 with SMTP id bi7mr1798515igb.9.1355336267103; Wed, 12 Dec 2012 10:17:47 -0800 (PST) Reply-To: vines@apache.org Received: by 10.64.26.136 with HTTP; Wed, 12 Dec 2012 10:17:47 -0800 (PST) Received: by 10.64.26.136 with HTTP; Wed, 12 Dec 2012 10:17:47 -0800 (PST) In-Reply-To: References: Date: Wed, 12 Dec 2012 13:17:47 -0500 Message-ID: Subject: Re: Table splitting From: John Vines To: user@accumulo.apache.org Content-Type: multipart/alternative; boundary=e89a8f3ba0dbc07d9804d0abd083 --e89a8f3ba0dbc07d9804d0abd083 Content-Type: text/plain; charset=ISO-8859-1 When a file gets written to hdfs, there is a guarantee the file is local as long as that systems disks are not full. Accumulo does not have a locality guarantee as tablets will migrate on occasion. However, as data is added, major compactions will occur which will restore locality. On Dec 12, 2012 1:09 PM, "ameet kini" wrote: > > Along these lines.... > > Can someone help me understand how tablets map to files on disk in HDFS? > From what I understand, after a compaction, there may be one (or more?) > files on HDFS for a given tablet. Each file can consist of multiple HDFS > blocks. Does Accumulo guarantee that the tablet serving a given data range > finds all its blocks locally? If so, how does it keep this guarantee? > Wouldn't HDFS distribute these blocks around based on HDFS balancing > strategy? > > Thanks, > Ameet > > On Tue, Dec 11, 2012 at 9:37 AM, William Slacum < > wilhelm.von.cloud@accumulo.net> wrote: > >> Tablets will split automatically, down to the granularity of a row. >> >> >> On Tue, Dec 11, 2012 at 9:32 AM, Mathias Herberts < >> mathias.herberts@gmail.com> wrote: >> >>> Hi, >>> >>> I've read the user manual for v1.4.2 and I have not seen any mention of >>> automatic tablet splitting. Is there such a thing in Accumulo or is >>> pre-splitting the only way to split a table? >>> >>> Thanks, >>> >>> Mathias. >>> >> >> > --e89a8f3ba0dbc07d9804d0abd083 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable

When a file gets written to hdfs, there is a guarantee the f= ile is local as long as that systems disks are not full. Accumulo does not = have a locality guarantee as tablets will migrate on occasion. However, as = data is added, major compactions will occur which will restore locality.

On Dec 12, 2012 1:09 PM, "ameet kini" = <ameetkini@gmail.com> wrot= e:

Along these lines....

Can someone help me= understand how tablets map to files on disk in HDFS? From what I understan= d, after a compaction, there may be one (or more?) files on HDFS for a give= n tablet. Each file can consist of multiple HDFS blocks. Does Accumulo guar= antee that the tablet serving a given data range finds all its blocks local= ly? If so, how does it keep this guarantee? Wouldn't HDFS distribute th= ese blocks around based on HDFS balancing strategy?

Thanks,
Ameet=A0

On Tue, Dec 11, 2012 at 9:37 AM, William Slacum <wilhelm.von.cloud@accumulo.net> wrote:
Tablets will split automatically, down to th= e granularity of a row.


On Tue, Dec 11, 2012 at 9:32 AM, Mathias Herberts <mathias.herber= ts@gmail.com> wrote:
Hi,

I've read the use= r manual for v1.4.2 and I have not seen any mention of automatic tablet spl= itting. Is there such a thing in Accumulo or is pre-splitting the only way = to split a table?

Thanks,

Mathias.


--e89a8f3ba0dbc07d9804d0abd083--