Return-Path: Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: (qmail 22302 invoked from network); 14 Feb 2011 18:17:43 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 14 Feb 2011 18:17:43 -0000 Received: (qmail 52617 invoked by uid 500); 14 Feb 2011 18:17:43 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 52367 invoked by uid 500); 14 Feb 2011 18:17:40 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 52359 invoked by uid 99); 14 Feb 2011 18:17:40 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Feb 2011 18:17:40 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of qwertymaniac@gmail.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Feb 2011 18:17:35 +0000 Received: by fxm2 with SMTP id 2so5413875fxm.35 for ; Mon, 14 Feb 2011 10:17:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=KPmNK6/Il8Fm9XQO55kHVS7iEda03sZV4jntqbi0XGA=; b=KeWtH39eHe3M+S3mbrgITnWd+AxR9Jt8YkLoF1FJqG9emkfcCTy+AEIhxgYx2uoGm1 cfTu7IGPRqkWTAkufjKW5yQcHalNq3mw53W6xyEI5yz6hvX+YKaeiskijbkGUzA6e9gf fOcBnPv1glbdIN+1pZJict2tDoxAhVhS2ZVx0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=THU0Mpp5jzAvVbuFmR5I/pkBunF/kSXaX9OHH7uo6+cNRAkJDs3ZosdBmx/wYIHJVJ MavGFw43RiIl8ULiM59DlUyBxymXxCfbAjnOt9glrQvFshEZf3TpTyigORivIBUjv2a1 MPQ7nPxlI/eqd3pWVCFVboxmv8EWPr5ipX5U0= Received: by 10.223.101.135 with SMTP id c7mr4927208fao.76.1297707433124; Mon, 14 Feb 2011 10:17:13 -0800 (PST) MIME-Version: 1.0 Received: by 10.223.122.83 with HTTP; Mon, 14 Feb 2011 10:16:52 -0800 (PST) In-Reply-To: References: From: Harsh J Date: Mon, 14 Feb 2011 23:46:52 +0530 Message-ID: Subject: Re: Map output files are SequenceFileFormat To: mapreduce-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Hello, On Mon, Feb 14, 2011 at 11:37 PM, Pedro Costa wrote: > And when the data of the map-intermediate files is compressed, it's > still an IFile? Yes. From my understanding, if compression is turned ON for IFile, the output stream for writing the IFile is itself set as a compressing one and all data written to the stream is compressed. In contrast, in SequenceFiles, compression is done in blocks (of a sizes set upon the Writer creation), and keys are left uncompressed. -- Harsh J www.harshj.com