Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DD0AB9EEE for ; Fri, 9 Mar 2012 21:55:22 +0000 (UTC) Received: (qmail 27698 invoked by uid 500); 9 Mar 2012 21:55:22 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 27575 invoked by uid 500); 9 Mar 2012 21:55:22 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 27567 invoked by uid 99); 9 Mar 2012 21:55:22 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Mar 2012 21:55:22 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Mar 2012 21:55:20 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 7FBC716D20 for ; Fri, 9 Mar 2012 21:54:59 +0000 (UTC) Date: Fri, 9 Mar 2012 21:54:59 +0000 (UTC) From: "Zhihong Yu (Commented) (JIRA)" To: issues@hbase.apache.org Message-ID: <233911562.45792.1331330099524.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1642646625.2855.1318891871332.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HBASE-4608) HLog Compression MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-4608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226519#comment-13226519 ] Zhihong Yu commented on HBASE-4608: ----------------------------------- {code} + public static int hashBytes(byte[] bytes, int offset, int length) { {code} The above method allows to start computation at specified offset while existing hashCode() doesn't have this parameter. The remark of putting compression flag as sequence file attribute is really good. Looking at SequenceFile.Sorter.cloneFileAttributes(), I don't see a convenient way for doing above. For HLogKey, can we designate version of -2 for representing compressed HLogKey ? If HLogKey isn't compressed, we write -1. > HLog Compression > ---------------- > > Key: HBASE-4608 > URL: https://issues.apache.org/jira/browse/HBASE-4608 > Project: HBase > Issue Type: New Feature > Reporter: Li Pi > Assignee: Li Pi > Fix For: 0.94.0 > > Attachments: 4608-v19.txt, 4608v1.txt, 4608v13.txt, 4608v13.txt, 4608v14.txt, 4608v15.txt, 4608v16.txt, 4608v17.txt, 4608v18.txt, 4608v5.txt, 4608v6.txt, 4608v7.txt, 4608v8fixed.txt > > > The current bottleneck to HBase write speed is replicating the WAL appends across different datanodes. We can speed up this process by compressing the HLog. Current plan involves using a dictionary to compress table name, region id, cf name, and possibly other bits of repeated data. Also, HLog format may be changed in other ways to produce a smaller HLog. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira