hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alejandro Abdelnur (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HADOOP-7206) Integrate Snappy compression
Date Fri, 24 Jun 2011 03:35:47 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-7206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Alejandro Abdelnur updated HADOOP-7206:

    Attachment: HADOOP-7206revertplusnew.patch

Attached is a patch based on Issay's patch.

* It reverts the current committed HADOOP-7206 & HADDOP-7407
* It uses Snappy native directly
* It adds the JNI bindings to Hadoop native
* Via {{configure.ac}}, if snappy is not available it ignores Snappy JNI bindings
* Snappy lib is looked (by default) at {{/usr/local}}
* Once Snappy is avail by default in different OSes, the default lookup can be changed to
{{/usr/}} for automatic detection
* Location of Snappy lib can be altered with {{-Dsnappy.prefix=}} Ant option
* SnappyCodec is defined in {{core-default.xml}}
* If the Snappy JNI bindings and/or Snappy are not present, SnappyCodec warns and continues

IMO this addresses the concerns previously discussed in the JIRA. And it will not break the
build in Apache Jenkins machines if Snappy is not install.

> Integrate Snappy compression
> ----------------------------
>                 Key: HADOOP-7206
>                 URL: https://issues.apache.org/jira/browse/HADOOP-7206
>             Project: Hadoop Common
>          Issue Type: New Feature
>    Affects Versions: 0.21.0
>            Reporter: Eli Collins
>            Assignee: Alejandro Abdelnur
>             Fix For: 0.23.0
>         Attachments: HADOOP-7206-002.patch, HADOOP-7206.patch, HADOOP-7206revertplusnew.patch,
v2-HADOOP-7206-snappy-codec-using-snappy-java.txt, v3-HADOOP-7206-snappy-codec-using-snappy-java.txt,
v4-HADOOP-7206-snappy-codec-using-snappy-java.txt, v5-HADOOP-7206-snappy-codec-using-snappy-java.txt
> Google release Zippy as an open source (APLv2) project called Snappy (http://code.google.com/p/snappy).
This tracks integrating it into Hadoop.
> {quote}
> Snappy is a compression/decompression library. It does not aim for maximum compression,
or compatibility with any other compression library; instead, it aims for very high speeds
and reasonable compression. For instance, compared to the fastest mode of zlib, Snappy is
an order of magnitude faster for most inputs, but the resulting compressed files are anywhere
from 20% to 100% bigger. On a single core of a Core i7 processor in 64-bit mode, Snappy compresses
at about 250 MB/sec or more and decompresses at about 500 MB/sec or more.
> {quote}

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message