Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 19DBF950E for ; Wed, 6 Jun 2012 20:47:48 +0000 (UTC) Received: (qmail 56829 invoked by uid 500); 6 Jun 2012 20:47:47 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 56804 invoked by uid 500); 6 Jun 2012 20:47:47 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 56796 invoked by uid 99); 6 Jun 2012 20:47:47 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Jun 2012 20:47:47 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [192.101.109.35] (HELO emailgw04.pnl.gov) (192.101.109.35) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Jun 2012 20:47:37 +0000 Received: from emailhub02.pnl.gov ([130.20.251.62]) by emailgw04.pnl.gov with ESMTP/TLS/AES128-SHA; 06 Jun 2012 13:47:16 -0700 Received: from email06.pnl.gov ([130.20.251.71]) by emailhub02.pnl.gov ([130.20.251.62]) with mapi; Wed, 6 Jun 2012 13:46:39 -0700 From: "Perko, Ralph J" To: "user@accumulo.apache.org" Date: Wed, 6 Jun 2012 13:46:30 -0700 Subject: Table design Thread-Topic: Table design Thread-Index: Ac1EJY5Jq8exG8gXSaW+MW7znyNYQA== Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: user-agent: Microsoft-MacOutlook/14.2.2.120421 acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Hi, I am in the process of designing some Accumulo tables for an app and h= ave some questions: Lookup Table: The data's natural qualifier is a title. This title can be any length. So= me are as long as 200 characters. I am using this title as a row id and also as a column qualifier in other p= laces. Is it considered good practice to have a lookup table for these titles (lik= e RDBMS), replacing them with some incremented integer value, or should I j= ust continue to use these long titles as row ids? Multiple Tables: What are the best practices around when to create a new table? I have been= breaking up my tables based on row id semantics. For example, title row i= ds are in a different table than row ids based on some analysis count. Does breaking up data into multiple tables, help/hurt/ or do nothing for ac= cumulo performance? Thanks, Ralph __________________________________________________ Ralph Perko Pacific Northwest National Laboratory