Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 21C2DE6BF for ; Thu, 14 Feb 2013 22:41:13 +0000 (UTC) Received: (qmail 52703 invoked by uid 500); 14 Feb 2013 22:41:12 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 52566 invoked by uid 500); 14 Feb 2013 22:41:12 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 52556 invoked by uid 99); 14 Feb 2013 22:41:12 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Feb 2013 22:41:12 +0000 Date: Thu, 14 Feb 2013 22:41:12 +0000 (UTC) From: "Harsh J (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (HBASE-7850) Bulkload final step can detect and pre-split tables MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Harsh J created HBASE-7850: ------------------------------ Summary: Bulkload final step can detect and pre-split tables Key: HBASE-7850 URL: https://issues.apache.org/jira/browse/HBASE-7850 Project: HBase Issue Type: Improvement Components: Client Reporter: Harsh J Priority: Minor Many new devs (read: POC folks?) aren't aware of the pre-split feature of table creation, given that its mostly manual. This leads to situations where a huge amount of data gets loaded into a single or a small set of regions, causing further issues such as non-assignment or lack of performance. Given that the final step of bulkload has a good picture of what they keys may look like, it could split the table first if it detects certain conditions (we can go over this in comments) and then perform the bulkload. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira