Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ED0C0106C7 for ; Sun, 26 Jan 2014 13:12:02 +0000 (UTC) Received: (qmail 64043 invoked by uid 500); 26 Jan 2014 13:12:02 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 63985 invoked by uid 500); 26 Jan 2014 13:12:01 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 63977 invoked by uid 99); 26 Jan 2014 13:12:00 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 26 Jan 2014 13:12:00 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jej2003@gmail.com designates 209.85.219.47 as permitted sender) Received: from [209.85.219.47] (HELO mail-oa0-f47.google.com) (209.85.219.47) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 26 Jan 2014 13:11:54 +0000 Received: by mail-oa0-f47.google.com with SMTP id m1so5596361oag.6 for ; Sun, 26 Jan 2014 05:11:33 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=kWg5WYMHTGJj+DPzn1JaQjTn5UPJT1RSS2k/sJAJjdE=; b=Uad2PKXu1BFz6sezU39Tuu7OofC3IfWEk3tCtFj/Ze1pUQwcoidJ6kcnd80HuNOe/Q jFk+/QejRym6LnYO45rOTjjDGQAh2S7/op+CVDZ/NxWgLinVKp6ageO3oYqghJNe97Re uuV222DNHyzA/nSGjDEeTQAF6GupUFN1Zo2Sn9C7rqahdnkgWxVQjX+FR3KQzRVJQue1 xEIdKyec3C2Uyu2ig7jYJMNocWst5tR90uUk6b+MPm+EpKwnan/FPVzB8C/zubtuo+Zb 4aUEunMQtTCsGQ2K6FRUwP86MlWUaNCbWs6fw2G7EdD1y3pnFkSTkrXG+mHUHigOlZm1 L0vQ== MIME-Version: 1.0 X-Received: by 10.60.98.40 with SMTP id ef8mr10482901oeb.13.1390741893740; Sun, 26 Jan 2014 05:11:33 -0800 (PST) Received: by 10.182.133.70 with HTTP; Sun, 26 Jan 2014 05:11:33 -0800 (PST) In-Reply-To: References: Date: Sun, 26 Jan 2014 08:11:33 -0500 Message-ID: Subject: Re: Key length limitations From: Jamie Johnson To: user@accumulo.apache.org Content-Type: multipart/alternative; boundary=089e013a18228cfe9c04f0df548f X-Virus-Checked: Checked by ClamAV on apache.org --089e013a18228cfe9c04f0df548f Content-Type: text/plain; charset=ISO-8859-1 Sorry for the basic question, but by key segment do you mean row key, column family column qualifier? So each should be less than a kilobyte or total? I think either way we are below that but I just want to make sure I understand the terminology. Again thanks for the reply. On Sun, Jan 26, 2014 at 2:12 AM, Christopher wrote: > On the order of a kilobyte, or less, per key segment is probably > ideal. Beyond that, you may want to experiment to see what works for > you. > > Up to the order of a megabyte may work fine in newer versions (at > least 1.4), but I'd start having concerns about your schema decisions > after that, even if your performance was fine... since ultimately, > Accumulo tables are a sorted index, and I'd wonder what kind of index > needs such large keys. > > I suspect a few hundred bytes is probably plenty for most applications. > > -- > Christopher L Tubbs II > http://gravatar.com/ctubbsii > > > On Sat, Jan 25, 2014 at 9:06 AM, Jamie Johnson wrote: > > I've read that in HBase there is a recommended limit on key lengths > because > > they're passed around quite a bit. Is the same true of accumulo? Are > there > > any recommendations in this regard? > --089e013a18228cfe9c04f0df548f Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Sorry for the basic question, but by key segment do you me= an row key, column family column qualifier? =A0So each should be less than = a kilobyte or total? =A0I think either way we are below that but I just wan= t to make sure I understand the terminology. =A0Again thanks for the reply.=


On Sun, Jan 2= 6, 2014 at 2:12 AM, Christopher <ctubbsii@apache.org> wrot= e:
On the order of a kilobyte, or less, per key= segment is probably
ideal. Beyond that, you may want to experiment to see what works for
you.

Up to the order of a megabyte may work fine in newer versions (at
least 1.4), but I'd start having concerns about your schema decisions after that, even if your performance was fine... since ultimately,
Accumulo tables are a sorted index, and I'd wonder what kind of index needs such large keys.

I suspect a few hundred bytes is probably plenty for most applications.

--
Christopher L Tubbs II
http://gravatar.= com/ctubbsii


On Sat, Jan 25, 2014 at 9:06 AM, Jamie Johnson <jej2003@gmail.com> wrote:
> I've read that in HBase there is a recommended limit on key length= s because
> they're passed around quite a bit. Is the same true of accumulo? = =A0Are there
> any recommendations in this regard?

--089e013a18228cfe9c04f0df548f--