Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@locus.apache.org Received: (qmail 89393 invoked from network); 26 Jun 2008 03:43:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 26 Jun 2008 03:43:06 -0000 Received: (qmail 2035 invoked by uid 500); 26 Jun 2008 03:43:07 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 2000 invoked by uid 500); 26 Jun 2008 03:43:07 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 1977 invoked by uid 99); 26 Jun 2008 03:43:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Jun 2008 20:43:07 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Jun 2008 03:42:25 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 1007B234C14F for ; Wed, 25 Jun 2008 20:42:45 -0700 (PDT) Message-ID: <1154902750.1214451765064.JavaMail.jira@brutus> Date: Wed, 25 Jun 2008 20:42:45 -0700 (PDT) From: "stack (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Assigned: (HBASE-707) High-load import of data into single table/family never triggers split In-Reply-To: <643024488.1214416007086.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack reassigned HBASE-707: --------------------------- Assignee: stack > High-load import of data into single table/family never triggers split > ---------------------------------------------------------------------- > > Key: HBASE-707 > URL: https://issues.apache.org/jira/browse/HBASE-707 > Project: Hadoop HBase > Issue Type: Bug > Affects Versions: 0.1.3 > Environment: Linux 2.6.25-14.fc9.x86_64, Fedora Core 9 > Reporter: Jonathan Gray > Assignee: stack > Fix For: 0.1.3 > > Attachments: 707.patch > > > Importing a heavy amount of data into a single table and family. > One column in that family (the same fam:col for every row) contains a frequently large amount of UTF-8 data. This column grows and grows but never causes a region split. > Currently there is a single mapfile containing nearly 10GB. > Eventually this has caused regions to crash with OOME, as described in HBASE-706 > Table in question: > hql > describe items; > +-----------------------------------------------------------------------------+ > | Column Family Descriptor | > +-----------------------------------------------------------------------------+ > | name: cfrecs, max versions: 2, compression: NONE, in memory: false, max leng| > | th: 2147483647, bloom filter: none | > +-----------------------------------------------------------------------------+ > | name: clusters, max versions: 2, compression: NONE, in memory: false, max le| > | ngth: 2147483647, bloom filter: none | > +-----------------------------------------------------------------------------+ > | name: content, max versions: 2, compression: NONE, in memory: false, max len| > | gth: 2147483647, bloom filter: none | > +-----------------------------------------------------------------------------+ > | name: readby, max versions: 2, compression: NONE, in memory: false, max leng| > | th: 2147483647, bloom filter: none | > +-----------------------------------------------------------------------------+ > | name: receivedby, max versions: 2, compression: NONE, in memory: false, max | > | length: 2147483647, bloom filter: none | > +-----------------------------------------------------------------------------+ > | name: savedby, max versions: 2, compression: NONE, in memory: false, max len| > | gth: 2147483647, bloom filter: none | > +-----------------------------------------------------------------------------+ > | name: sentby, max versions: 2, compression: NONE, in memory: false, max leng| > | th: 2147483647, bloom filter: none | > +-----------------------------------------------------------------------------+ > 7 columnfamily(s) in set. (0.34 sec) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.