Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 81222 invoked from network); 3 Aug 2007 17:11:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 3 Aug 2007 17:11:15 -0000 Received: (qmail 29254 invoked by uid 500); 3 Aug 2007 17:11:13 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 29223 invoked by uid 500); 3 Aug 2007 17:11:13 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 29212 invoked by uid 99); 3 Aug 2007 17:11:13 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Aug 2007 10:11:13 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Aug 2007 17:11:13 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 2CA8E7141F5 for ; Fri, 3 Aug 2007 10:10:53 -0700 (PDT) Message-ID: <3514244.1186161053180.JavaMail.jira@brutus> Date: Fri, 3 Aug 2007 10:10:53 -0700 (PDT) From: "Jim Kellerman (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Updated: (HADOOP-1644) [hbase] Compactions should not block updates In-Reply-To: <21764521.1184954346200.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Kellerman updated HADOOP-1644: ---------------------------------- Fix Version/s: 0.15.0 Priority: Major (was: Minor) Issue Type: Improvement (was: Wish) Affects Version/s: 0.15.0 Summary: [hbase] Compactions should not block updates (was: [hbase] Compactions should take no longer than period between memcache flushes) > [hbase] Compactions should not block updates > -------------------------------------------- > > Key: HADOOP-1644 > URL: https://issues.apache.org/jira/browse/HADOOP-1644 > Project: Hadoop > Issue Type: Improvement > Components: contrib/hbase > Affects Versions: 0.15.0 > Reporter: stack > Assignee: stack > Fix For: 0.15.0 > > > Currently, compactions take a long time. During compaction, updates are carried by the HRegions' memcache (+ backing HLog). memcache is unable to flush to disk until compaction completes. > Under sustained, substantial -- rows that contain multiple columns one of which is a web page -- updates by multiple concurrent clients (10 in this case), a common hbase usage scenario, the memcache grows fast and often to orders of magnitude in excess of the configured 'flush-to-disk' threshold. > This throws the whole system out of kilter. When memcache does get to run after compaction completes -- assuming you have sufficent RAM and the region server doesn't OOME -- then the resulting on-disk file will be way larger than any other on-disk HStoreFile bringing on a region split ..... but the resulting split will produce regions that themselves need to be immediately split because each half is beyond the configured limit, and so on... > In another issue yet to be posted, tuning and some pointed memcache flushes makes the above condition less extreme but until compaction durations come close to the memcache flush threshold compactions will remain disruptive. > Its allowed that compactions may never be fast enough as per bigtable paper (This is a 'wish' issue). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.