Return-Path: Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: (qmail 78016 invoked from network); 23 Jul 2010 00:36:22 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 23 Jul 2010 00:36:22 -0000 Received: (qmail 3284 invoked by uid 500); 23 Jul 2010 00:36:22 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 3231 invoked by uid 500); 23 Jul 2010 00:36:22 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 3223 invoked by uid 99); 23 Jul 2010 00:36:22 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 Jul 2010 00:36:22 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 Jul 2010 00:36:15 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o6N0Zsab006615 for ; Fri, 23 Jul 2010 00:35:54 GMT Message-ID: <31357723.530481279845354051.JavaMail.jira@thor> Date: Thu, 22 Jul 2010 20:35:54 -0400 (EDT) From: "HBase Review Board (JIRA)" To: issues@hbase.apache.org Subject: [jira] Commented: (HBASE-1364) [performance] Distributed splitting of regionserver commit logs MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891424#action_12891424 ] HBase Review Board commented on HBASE-1364: ------------------------------------------- Message from: "Alex Newman" ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: http://review.hbase.org/r/370/ ----------------------------------------------------------- Review request for hbase. Summary ------- This build on the previous work. It does some smarter stuff with testing and now splitting is configurable. This addresses bug hbase-1364. http://issues.apache.org/jira/browse/hbase-1364 Diffs ----- src/main/java/org/apache/hadoop/hbase/HConstants.java c77ebf5 src/main/java/org/apache/hadoop/hbase/regionserver/HRegionServer.java f251d54 src/main/java/org/apache/hadoop/hbase/regionserver/LogSplitter.java PRE-CREATION src/main/java/org/apache/hadoop/hbase/regionserver/wal/HLog.java 5688c03 src/main/java/org/apache/hadoop/hbase/zookeeper/ZooKeeperWrapper.java 8225178 src/main/resources/hbase-default.xml e3a9669 src/test/java/org/apache/hadoop/hbase/regionserver/wal/BaseTestHLogSplit.java PRE-CREATION src/test/java/org/apache/hadoop/hbase/regionserver/wal/DistributedTestHLog.java PRE-CREATION src/test/java/org/apache/hadoop/hbase/regionserver/wal/DistributedTestHLogSplit.java PRE-CREATION src/test/java/org/apache/hadoop/hbase/regionserver/wal/DistributedTestHLogSplitSkipErrors.java PRE-CREATION src/test/java/org/apache/hadoop/hbase/regionserver/wal/DistributedTestLogRolling.java PRE-CREATION src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestHLog.java ad8f9e5 src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestHLogSplit.java 908633e src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestHLogSplitSkipErrors.java PRE-CREATION src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestLogActionsListener.java 776d78c src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestLogRolling.java 9eae4b4 src/test/resources/hbase-site.xml 3c0601a Diff: http://review.hbase.org/r/370/diff Testing ------- ran on our private hudson Thanks, Alex > [performance] Distributed splitting of regionserver commit logs > --------------------------------------------------------------- > > Key: HBASE-1364 > URL: https://issues.apache.org/jira/browse/HBASE-1364 > Project: HBase > Issue Type: Improvement > Reporter: stack > Assignee: Alex Newman > Priority: Critical > Fix For: 0.92.0 > > Attachments: 1 (3), 1364-v2.patch, 1364.patch > > Time Spent: 8h > Remaining Estimate: 0h > > HBASE-1008 has some improvements to our log splitting on regionserver crash; but it needs to run even faster. > (Below is from HBASE-1008) > In bigtable paper, the split is distributed. If we're going to have 1000 logs, we need to distribute or at least multithread the splitting. > 1. As is, regions starting up expect to find one reconstruction log only. Need to make it so pick up a bunch of edit logs and it should be fine that logs are elsewhere in hdfs in an output directory written by all split participants whether multithreaded or a mapreduce-like distributed process (Lets write our distributed sort first as a MR so we learn whats involved; distributed sort, as much as possible should use MR framework pieces). On startup, regions go to this directory and pick up the files written by split participants deleting and clearing the dir when all have been read in. Making it so can take multiple logs for input, can also make the split process more robust rather than current tenuous process which loses all edits if it doesn't make it to the end without error. > 2. Each column family rereads the reconstruction log to find its edits. Need to fix that. Split can sort the edits by column family so store only reads its edits. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.