Return-Path: X-Original-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CF24F4D05 for ; Sat, 18 Jun 2011 01:00:14 +0000 (UTC) Received: (qmail 27495 invoked by uid 500); 18 Jun 2011 01:00:13 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 27454 invoked by uid 500); 18 Jun 2011 01:00:13 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 27216 invoked by uid 99); 18 Jun 2011 01:00:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 18 Jun 2011 01:00:12 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 18 Jun 2011 01:00:09 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 0592241C406 for ; Sat, 18 Jun 2011 00:59:49 +0000 (UTC) Date: Sat, 18 Jun 2011 00:59:49 +0000 (UTC) From: "Tom White (JIRA)" To: common-issues@hadoop.apache.org Message-ID: <526744128.17205.1308358789019.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1240879157.4951.1300837565772.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HADOOP-7206) Integrate Snappy compression MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-7206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13051423#comment-13051423 ] Tom White commented on HADOOP-7206: ----------------------------------- I noticed that the compression overhead in this patch is {{(bufferSize >> 3) + 128 + 3}} which is less than the maximum possible blowup that Snappy allows for (http://code.google.com/p/snappy/source/browse/trunk/snappy.cc#55). Should this be changed to {{bufferSize / 6 + 32}}? > Integrate Snappy compression > ---------------------------- > > Key: HADOOP-7206 > URL: https://issues.apache.org/jira/browse/HADOOP-7206 > Project: Hadoop Common > Issue Type: New Feature > Affects Versions: 0.21.0 > Reporter: Eli Collins > Assignee: issei yoshida > Attachments: HADOOP-7206-002.patch, HADOOP-7206.patch, v2-HADOOP-7206-snappy-codec-using-snappy-java.txt, v3-HADOOP-7206-snappy-codec-using-snappy-java.txt, v4-HADOOP-7206-snappy-codec-using-snappy-java.txt > > > Google release Zippy as an open source (APLv2) project called Snappy (http://code.google.com/p/snappy). This tracks integrating it into Hadoop. > {quote} > Snappy is a compression/decompression library. It does not aim for maximum compression, or compatibility with any other compression library; instead, it aims for very high speeds and reasonable compression. For instance, compared to the fastest mode of zlib, Snappy is an order of magnitude faster for most inputs, but the resulting compressed files are anywhere from 20% to 100% bigger. On a single core of a Core i7 processor in 64-bit mode, Snappy compresses at about 250 MB/sec or more and decompresses at about 500 MB/sec or more. > {quote} -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira