Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@locus.apache.org Received: (qmail 64609 invoked from network); 24 Feb 2008 04:22:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 24 Feb 2008 04:22:06 -0000 Received: (qmail 55307 invoked by uid 500); 24 Feb 2008 04:22:01 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 55285 invoked by uid 500); 24 Feb 2008 04:22:01 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 55276 invoked by uid 99); 24 Feb 2008 04:22:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 23 Feb 2008 20:22:01 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 24 Feb 2008 04:21:22 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 6A4A9234C010 for ; Sat, 23 Feb 2008 20:21:19 -0800 (PST) Message-ID: <205369348.1203826879434.JavaMail.jira@brutus> Date: Sat, 23 Feb 2008 20:21:19 -0800 (PST) From: "Jim Kellerman (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Commented: (HBASE-76) [hbase] performance: Try to purge servers of Text MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12571852#action_12571852 ] Jim Kellerman commented on HBASE-76: ------------------------------------ Agreed on separation. The comparison of String and Text was just something I wanted to settle in my mind since I had heard that String was far less efficient than Text. Turns out, not so much. > [hbase] performance: Try to purge servers of Text > ------------------------------------------------- > > Key: HBASE-76 > URL: https://issues.apache.org/jira/browse/HBASE-76 > Project: Hadoop HBase > Issue Type: Improvement > Components: regionserver > Reporter: stack > Priority: Minor > Attachments: TextVsString.java > > > Chatting with Jim while looking at profiler outputs, we should make an effort at purging the servers of the Text type so HRegionServer doesn't ever have to deal in Characters and the concomitant encode/decode to UTF-8. Toward this end, we'd make changes like moving HStoreKey to have four rather than 3 data members: column family, column family qualifier, row + timestamp done as a basic Writable -- ImmutableBytesWritable? -- and a long rather than a Text column, Text row and a timestamp long. This would save on our having to do the relatively expensive 'find' of the column family separator inside in extractFamily (>10% of CPU scanning). Chatting about it, we could effect the change without change in the public client API; clients could continue to take Text type for row and column and then client-side, the convertion to HStoreKey could be done before crossing the wire to the server. > -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.