Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 259E91807B for ; Wed, 29 Jul 2015 22:41:41 +0000 (UTC) Received: (qmail 53927 invoked by uid 500); 29 Jul 2015 22:41:36 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 53786 invoked by uid 500); 29 Jul 2015 22:41:36 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 53776 invoked by uid 99); 29 Jul 2015 22:41:35 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Jul 2015 22:41:35 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 3DFA91A8C01 for ; Wed, 29 Jul 2015 22:41:35 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.792 X-Spam-Level: * X-Spam-Status: No, score=1.792 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H2=-1.108, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id D9wao2Ksz_mB for ; Wed, 29 Jul 2015 22:41:31 +0000 (UTC) Received: from mail-yk0-f176.google.com (mail-yk0-f176.google.com [209.85.160.176]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 44A9D20DD8 for ; Wed, 29 Jul 2015 22:41:30 +0000 (UTC) Received: by ykay190 with SMTP id y190so20539986yka.3 for ; Wed, 29 Jul 2015 15:40:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=/iRKYp0KyXXskCl3JLqzPtzELaMzV+UUIqDVTMm8OSs=; b=MKIBTuXgB/LBUkazNK7v+Bwk3CxACSS7Y3Y30JLlJ5Oo+alRjlGOF0v4yF76OC8/47 GZnrDDJmGHmFy3VToZhH+iBECAIouV7xVEuiNggZu0mUgfazx63hayMIUxSpzNw7LUho T3wL8JZ5/s05ZFa8uOqkCkNGyHhUmqp/kbap1Huus8Ebb31Q3PZH/3ZkXHemhLA8Zhp5 opblXJjX65LatHDDflbAY6zv48GS4LkT3UXSDg6IAOcHSUZaX0MQK3ir4EJKhS43Vkg2 8h40ng/dmSUHyx+F4euXl0pByhVIiBbtb/FeJYhKUHwMIZhTALJUQeueTu7gitn52cCv l85g== MIME-Version: 1.0 X-Received: by 10.170.140.2 with SMTP id h2mr47532332ykc.96.1438209638281; Wed, 29 Jul 2015 15:40:38 -0700 (PDT) Received: by 10.37.214.144 with HTTP; Wed, 29 Jul 2015 15:40:38 -0700 (PDT) In-Reply-To: References: Date: Wed, 29 Jul 2015 15:40:38 -0700 Message-ID: Subject: Re: Need command to compress the files From: Ted Yu To: "common-user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=001a1139fab09a3894051c0b46ff --001a1139fab09a3894051c0b46ff Content-Type: text/plain; charset=UTF-8 You can use the following command to see options for gzip: gzip -h For snappy, see: https://github.com/kubo/snzip https://code.google.com/p/snappy/issues/detail?id=34 FYI On Wed, Jul 29, 2015 at 3:34 PM, SP wrote: > Hi All, > > I am working on comparing different compression ratios. > > I have these files in AVRO format. How can I compress them using snappy or > gzip. > > -rw-r--r-- 3 hdfs supergroup 3080866838 2015-07-29 18:16 > /tmp/fact_splitby_date_id/part-m-00000.avro > -rw-r--r-- 3 hdfs supergroup 3021258762 2015-07-29 18:15 > /tmp/fact_splitby_date_id/part-m-00001.avro > -rw-r--r-- 3 hdfs supergroup 3164101762 2015-07-29 18:17 > /tmp/fact_splitby_date_id/part-m-00002.avro > -rw-r--r-- 3 hdfs supergroup 3251578205 2015-07-29 18:16 > /tmp/fact_splitby_date_id/part-m-00003.avro > > > > > Thanks > Sp > --001a1139fab09a3894051c0b46ff Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
You can use the following command to see options for gzip:=
gzip -h

For snappy, see:
https://github.com/kubo/snzip

FYI

On Wed, Jul 29, 2015 at 3:34 PM, SP <sajidmca@gmail.com= > wrote:
Hi Al= l,

I am working on comparing different compression ratio= s.=C2=A0

I have these files in AVRO format. How ca= n I compress them using snappy or gzip.

-rw-r= --r-- =C2=A0 3 hdfs supergroup 3080866838 2015-07-29 18:16 /tmp/fact_splitb= y_date_id/part-m-00000.avro
-rw-r--r-- =C2=A0 3 hdfs supergroup 3= 021258762 2015-07-29 18:15 /tmp/fact_splitby_date_id/part-m-00001.avro
-rw-r--r-- =C2=A0 3 hdfs supergroup 3164101762 2015-07-29 18:17 /tmp/= fact_splitby_date_id/part-m-00002.avro
-rw-r--r-- =C2=A0 3 hdfs s= upergroup 3251578205 2015-07-29 18:16 /tmp/fact_splitby_date_id/part-m-0000= 3.avro




Thanks
Sp<= /div>

--001a1139fab09a3894051c0b46ff--