Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@locus.apache.org Received: (qmail 37648 invoked from network); 12 Dec 2008 20:46:05 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 12 Dec 2008 20:46:05 -0000 Received: (qmail 76264 invoked by uid 500); 12 Dec 2008 20:46:17 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 76245 invoked by uid 500); 12 Dec 2008 20:46:17 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 76234 invoked by uid 99); 12 Dec 2008 20:46:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Dec 2008 12:46:17 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Dec 2008 20:46:04 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 660B9234C3BD for ; Fri, 12 Dec 2008 12:45:44 -0800 (PST) Message-ID: <1815842705.1229114744416.JavaMail.jira@brutus> Date: Fri, 12 Dec 2008 12:45:44 -0800 (PST) From: "Jonathan Gray (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Updated: (HBASE-1057) Example MR jobs to simulate bulk importing In-Reply-To: <519721255.1229114144419.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Gray updated HBASE-1057: --------------------------------- Attachment: ImportTestMR_v1.java MR job with tweakable parameters that bulk uploads rows with fixed-length and variable-length columns and values. Currently requires a single line HDFS file as input and an HBase table that exists as output, though this uses the API directly and not TOF. Someone better with MR might be able to remove these two requirements. There are some commented out lines in the case that you want to use write batching. Can be easily modified to use TOF instead. > Example MR jobs to simulate bulk importing > ------------------------------------------ > > Key: HBASE-1057 > URL: https://issues.apache.org/jira/browse/HBASE-1057 > Project: Hadoop HBase > Issue Type: New Feature > Reporter: Jonathan Gray > Assignee: Jonathan Gray > Priority: Trivial > Attachments: ImportTestMR_v1.java > > > It's very useful to have standalone MR jobs that simulate production system load characteristics. Specifically bulk importing as this has been uncovering OOME and long-running compaction issues. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.