hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alejandro Abdelnur (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-7206) Integrate Snappy compression
Date Mon, 13 Jun 2011 22:51:49 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-7206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13048851#comment-13048851

Alejandro Abdelnur commented on HADOOP-7206:


1. Snappy-Java bundles the native libraries in the JAR itself. While that is convenient/clever
packaging technique, this is different from how Hadoop handles native libraries (loading them
from lib/native/${OS_ARCH}/).

2. The motivation for keeping hadoop-snappy independent of hadoop was that we could use it
right the way in other projects (HBase already integrated it).

I would strongly argue that native libraries should handled in a consistent maner in Hadoop.

And, if the preference of the Hadoop folks is to bundle snappy in Hadoop (dismissing #2),
then I'd advocate for bringing Hadoop-Snappy into Hadoop as this JIRA originally proposed.
By doing this we would have 1 external dependency (snappy) instead 2 (snappy-java and snappy,
with the side effect that if we need a new version of snappy we would have to wait for snappy-java
to do a release with it).


> Integrate Snappy compression
> ----------------------------
>                 Key: HADOOP-7206
>                 URL: https://issues.apache.org/jira/browse/HADOOP-7206
>             Project: Hadoop Common
>          Issue Type: New Feature
>    Affects Versions: 0.21.0
>            Reporter: Eli Collins
>         Attachments: HADOOP-7206.patch
> Google release Zippy as an open source (APLv2) project called Snappy (http://code.google.com/p/snappy).
This tracks integrating it into Hadoop.
> {quote}
> Snappy is a compression/decompression library. It does not aim for maximum compression,
or compatibility with any other compression library; instead, it aims for very high speeds
and reasonable compression. For instance, compared to the fastest mode of zlib, Snappy is
an order of magnitude faster for most inputs, but the resulting compressed files are anywhere
from 20% to 100% bigger. On a single core of a Core i7 processor in 64-bit mode, Snappy compresses
at about 250 MB/sec or more and decompresses at about 500 MB/sec or more.
> {quote}

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message