hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michele (@pirroh) Catasta (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-2655) 2-pass compression support
Date Wed, 02 Jun 2010 15:39:42 GMT

    [ https://issues.apache.org/jira/browse/HBASE-2655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12874616#action_12874616
] 

Michele (@pirroh) Catasta commented on HBASE-2655:
--------------------------------------------------

Right, {NAME=>'cfamily', COMPRESSION=>'BMZ'} will do the job.

w.r.t. runtime exception: at the moment, it's happening the same for LZO. I just reproduced
the behavior I found in that class *smile*
Default fallback to NONE might be an option as well, but it would let you create the table
anyway - so people that are using hbase shell SCRIPT to create tables might experience some
regressions. Matter of tastes I'd say!
Anyway, if you agree on that I'll create another jira to deal with Compression.java and update
this patch as well.


> 2-pass compression support
> --------------------------
>
>                 Key: HBASE-2655
>                 URL: https://issues.apache.org/jira/browse/HBASE-2655
>             Project: HBase
>          Issue Type: New Feature
>          Components: io
>            Reporter: Michele (@pirroh) Catasta
>            Priority: Minor
>             Fix For: 0.21.0
>
>         Attachments: HBASE-2655.diff
>
>
> Quoting from BigTable paper: "Many clients use a two-pass custom compression scheme.
The first pass uses Bentley and McIlroy's scheme, which compresses long common strings across
a large window. The second pass uses a fast compression algorithm that looks for repetitions
in a small 16 KB window of the data. Both compression passes are very fast—they encode at
100-200 MB/s, and decode at 400-1000 MB/s on modern machines."
> The goal of this patch is to integrate a similar compression scheme in HBase.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message