Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 23B4410D9C for ; Wed, 4 Dec 2013 19:19:03 +0000 (UTC) Received: (qmail 44762 invoked by uid 500); 4 Dec 2013 19:19:02 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 44716 invoked by uid 500); 4 Dec 2013 19:19:02 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 44708 invoked by uid 99); 4 Dec 2013 19:19:02 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Dec 2013 19:19:02 +0000 X-ASF-Spam-Status: No, hits=1.0 required=5.0 tests=FREEMAIL_REPLY,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yuzhihong@gmail.com designates 209.85.192.170 as permitted sender) Received: from [209.85.192.170] (HELO mail-pd0-f170.google.com) (209.85.192.170) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Dec 2013 19:18:57 +0000 Received: by mail-pd0-f170.google.com with SMTP id g10so23108477pdj.29 for ; Wed, 04 Dec 2013 11:18:36 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=references:mime-version:in-reply-to:content-type :content-transfer-encoding:message-id:cc:from:subject:date:to; bh=C1SC7j1+/q7OQ6CNaGMxysrWkYnL4mCXwDJESAul1Nk=; b=H9YaunFpXLS4LxVQHzKfTYgDzr+MruPrOL0QNhKqr3i0S/hsccDakq6lFoTVbjmN34 CWJTR5kZlHMOGcOrOhxvgmpELQQtLAIqyU9/P2h8+fAWINybduCIVrmMoT0frON6fgld fS1bCMPcmEsYW2jnIhbPpdQkxS+PF08B/AWVtrkiCBxjpibigs6Zy7/uc2oDRkxVgPqU EwzvCdTm1vOeRE3WcrwZOP9DIV+bGu8fS4Lzz9Tcb3HEOuojt9mgs13DQGk7dFkN+3Ej eyQflooip5YnLOUuuveo6GHqF+h0ELMskamFEYBknj631JtC3kremR7E12QEa36lac6n XWWA== X-Received: by 10.68.172.196 with SMTP id be4mr47629646pbc.12.1386184716733; Wed, 04 Dec 2013 11:18:36 -0800 (PST) Received: from [172.31.16.182] ([222.128.247.163]) by mx.google.com with ESMTPSA id hw10sm139186255pbc.24.2013.12.04.11.18.35 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Wed, 04 Dec 2013 11:18:36 -0800 (PST) References: <529F5712.2050809@gmail.com> Mime-Version: 1.0 (1.0) In-Reply-To: <529F5712.2050809@gmail.com> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Message-Id: <7D900D11-8E12-45E4-906D-370A453F4DDE@gmail.com> Cc: "user@accumulo.apache.org" X-Mailer: iPhone Mail (10B146) From: Ted Yu Subject: Re: HBase rowkey design guidelines Date: Thu, 5 Dec 2013 03:18:18 +0800 To: "user@accumulo.apache.org" X-Virus-Checked: Checked by ClamAV on apache.org For HFile v3, please take a look at: HBASE-9045 Dictionary based tag compression Cheers On Dec 5, 2013, at 12:23 AM, Josh Elser wrote: > They have a couple of different encoding strategies in HFile v2 that are s= imilar. >=20 > https://issues.apache.org/jira/browse/HBASE-4218 > https://issues.apache.org/jira/browse/HBASE-4676 >=20 > Not sure if there are any new slated approaches for HFile v3. >=20 > On 12/4/13, 12:28 AM, John Vines wrote: >> Also, I'm not sure if HBase has the encoding techniques that we utilize >> in our RFile >>=20 >> On Wed, Dec 4, 2013 at 12:19 AM, Mike Drob > > wrote: >>=20 >> Well, yes and no. >>=20 >> Smaller keys still mean less network traffic, potentially less IO, >> and maybe faster operations if you're trying to do application >> logic. Using data or default or just d probably doesn't matter in >> the long term (although there are certainly cases where it might). >>=20 >> On Dec 3, 2013 11:57 PM, "David Medinets" > > wrote: >>=20 >> http://hbase.apache.org/book/rowkey.design.html - unless I am >> misunderstanding much of the advice given for HBase simply >> doesn't apply to Accumulo. For example "Try to keep the >> ColumnFamily names as small as possible, preferably one >> character (e.g. "d" for data/default)." >>=20 >>=20