Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 59EE6DBB4 for ; Tue, 18 Dec 2012 18:42:15 +0000 (UTC) Received: (qmail 78137 invoked by uid 500); 18 Dec 2012 18:42:15 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 77833 invoked by uid 500); 18 Dec 2012 18:42:14 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 77524 invoked by uid 99); 18 Dec 2012 18:42:14 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Dec 2012 18:42:14 +0000 Date: Tue, 18 Dec 2012 18:42:14 +0000 (UTC) From: "Christopher Tubbs (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (ACCUMULO-501) RFile should store the key count in metadata MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ACCUMULO-501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christopher Tubbs updated ACCUMULO-501: --------------------------------------- Fix Version/s: (was: 1.5.0) > RFile should store the key count in metadata > -------------------------------------------- > > Key: ACCUMULO-501 > URL: https://issues.apache.org/jira/browse/ACCUMULO-501 > Project: Accumulo > Issue Type: Improvement > Reporter: Eric Newton > Assignee: Eric Newton > > BulkImport estimates the number of keys in a file to be zero. We store the largest and smallest key in metadata, I think we can afford to store the key count use it to provide an estimate when we load it into the tablet. Perhaps if we know the start key is "a" and the end key is "z" and the tablets range is "a->m" we can just estimate 50% of the key count. > When a bulk file fits completely in a range, the key count estimate will be accurate. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira