Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AF7C1101F9 for ; Thu, 1 Aug 2013 20:39:49 +0000 (UTC) Received: (qmail 65424 invoked by uid 500); 1 Aug 2013 20:39:49 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 65295 invoked by uid 500); 1 Aug 2013 20:39:49 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 65287 invoked by uid 500); 1 Aug 2013 20:39:49 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 65284 invoked by uid 99); 1 Aug 2013 20:39:49 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Aug 2013 20:39:49 +0000 Date: Thu, 1 Aug 2013 20:39:49 +0000 (UTC) From: "vikram s (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-2590) HBase bulk load wiki page improvements MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-2590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13726853#comment-13726853 ] vikram s commented on HIVE-2590: -------------------------------- I wanted to update wiki.as most of of the stuff is unclear. > HBase bulk load wiki page improvements > -------------------------------------- > > Key: HIVE-2590 > URL: https://issues.apache.org/jira/browse/HIVE-2590 > Project: Hive > Issue Type: Bug > Components: Documentation, HBase Handler > Reporter: Ben West > Assignee: Ben West > Priority: Minor > Labels: wiki > Fix For: 0.8.0 > > > Some suggestions on the page https://cwiki.apache.org/confluence/display/Hive/HBaseBulkLoad which seems kind of out of date: > 1. It seems like it's required that the number of reduce tasks in the "Sort Data" phase be one more than the number of keys selected in the "Range Partitioning" step, or else you get an error like this: > Caused by: java.lang.IllegalArgumentException: Can't read partitions file > at org.apache.hadoop.mapred.lib.TotalOrderPartitioner.configure(TotalOrderPartitioner.java:91) > ... 15 more > Caused by: java.io.IOException: Wrong number of partitions in keyset > at org.apache.hadoop.mapred.lib.TotalOrderPartitioner.configure(TotalOrderPartitioner.java:72) > ... 15 more > If so, it would be helpful if this was explicitly pointed out. > 2. It recommends that you should use the "loadtable" ruby script to put data into hbase, but if you run this on newer versions of HBase (e.g. 0.90.3) it errors: > DISABLED!!!! Use completebulkload instead. See tail of http://hbase.apache.org/bulk-loads.html > The instructions should probably be changed to use completebulkload instead of this script. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira