hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Benoit Sigoure (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-7206) Integrate Snappy compression
Date Mon, 28 Mar 2011 02:18:06 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-7206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13011893#comment-13011893

Benoit Sigoure commented on HADOOP-7206:

Allen, I agree that situation with back-porting everything in Hadoop has gotten to a ridiculous
point.  So while I understand and share your desire to stop back-porting things or splitting
things out, you also have to understand the desire of users like us whose business greatly
depends on Hadoop/HBase and where we need to move forward quickly.  If we were to wait until
Apache releases a version of Hadoop we can use in production with HBase (proper append, no
data loss, etc), we'd still be waiting.  So although I don't like the current situation with
Hadoop either, I'm glad someone did the grungy work of back-porting things or splitting some
things out so we could move forward.

What Todd is proposing is simply a way to make Snappy available quickly to users like us,
and we'd be very happy about that.  I think it's in everyone's interest to make this available
as soon as possible and not wait for a future Hadoop release.

Note: we're not Cloudera customers, but we use CDH because It Just Works.

> Integrate Snappy compression
> ----------------------------
>                 Key: HADOOP-7206
>                 URL: https://issues.apache.org/jira/browse/HADOOP-7206
>             Project: Hadoop Common
>          Issue Type: New Feature
>            Reporter: Eli Collins
> Google release Zippy as an open source (APLv2) project called Snappy (http://code.google.com/p/snappy).
This tracks integrating it into Hadoop.
> {quote}
> Snappy is a compression/decompression library. It does not aim for maximum compression,
or compatibility with any other compression library; instead, it aims for very high speeds
and reasonable compression. For instance, compared to the fastest mode of zlib, Snappy is
an order of magnitude faster for most inputs, but the resulting compressed files are anywhere
from 20% to 100% bigger. On a single core of a Core i7 processor in 64-bit mode, Snappy compresses
at about 250 MB/sec or more and decompresses at about 500 MB/sec or more.
> {quote}

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message