hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Todd Lipcon (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-7206) Integrate Snappy compression
Date Wed, 23 Mar 2011 17:32:06 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-7206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13010244#comment-13010244

Todd Lipcon commented on HADOOP-7206:

Hey Allen. When there are real customers (either external customers in my case or internal
customers at a lot of the bigger companies using Hadoop) who need a feature, it's not always
possible to say no.

Given the choice between putting it in core (and most likely having major users and distros
backport) vs putting it in an external library, I prefer the external library. I don't follow
your reasoning that putting it in core will speed up an Apache release -- when has adding
new code to a project ever made its release cycle faster?

FWIW I have expressed this same opinion even on projects like HBase where releases are frequent
and people run trees that are very close to the Apache bits. Keeping projects small makes
releases easier,     not harder, and frequent releases means fewer backports.

> Integrate Snappy compression
> ----------------------------
>                 Key: HADOOP-7206
>                 URL: https://issues.apache.org/jira/browse/HADOOP-7206
>             Project: Hadoop Common
>          Issue Type: New Feature
>            Reporter: Eli Collins
> Google release Zippy as an open source (APLv2) project called Snappy (http://code.google.com/p/snappy).
This tracks integrating it into Hadoop.
> {quote}
> Snappy is a compression/decompression library. It does not aim for maximum compression,
or compatibility with any other compression library; instead, it aims for very high speeds
and reasonable compression. For instance, compared to the fastest mode of zlib, Snappy is
an order of magnitude faster for most inputs, but the resulting compressed files are anywhere
from 20% to 100% bigger. On a single core of a Core i7 processor in 64-bit mode, Snappy compresses
at about 250 MB/sec or more and decompresses at about 500 MB/sec or more.
> {quote}

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message