Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 12491 invoked from network); 1 Apr 2011 07:00:30 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 1 Apr 2011 07:00:30 -0000 Received: (qmail 5242 invoked by uid 500); 1 Apr 2011 07:00:27 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 5186 invoked by uid 500); 1 Apr 2011 07:00:27 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 5178 invoked by uid 99); 1 Apr 2011 07:00:26 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Apr 2011 07:00:26 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of qwertymaniac@gmail.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Apr 2011 07:00:22 +0000 Received: by fxm7 with SMTP id 7so3911335fxm.35 for ; Fri, 01 Apr 2011 00:00:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type:content-transfer-encoding; bh=Sa1Y4WkOih4Pzj7chzmUpFu4GkLUSk9bittLVmGbz1M=; b=gS+qKiuc35+naop1OQJk8ijuSjUGLKAzRu1eRhrtpNxEWszRIzjIOFyCoAZIonYxMi RYIR1wK6Gt5ApGFNfTpsZqplt7bkMjCD3MEJ3FASiTkTFLm8YIAN7BFQzMG4cEJjW983 +g9xyydORGodyVcjkiOkVa7aEhoPuvubc84PU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=GQ+J3XkYwbSEstt5AKg3Rm5q3FOPcroF+vRGYeMvG0aDzD8D444QNrj9NpnG87+Bch AeUAMb6iitH38Aw2GBgj/gvqf+53EZ9eEj1oKe1M39nAQOl3JKG/jkbwVz1oYq9z/E7k 4p3a1dpyE59ilV11YlkafTjOXMenvhsFPL7tI= Received: by 10.223.14.137 with SMTP id g9mr202119faa.2.1301641194182; Thu, 31 Mar 2011 23:59:54 -0700 (PDT) MIME-Version: 1.0 Received: by 10.223.87.214 with HTTP; Thu, 31 Mar 2011 23:59:34 -0700 (PDT) In-Reply-To: References: From: Harsh J Date: Fri, 1 Apr 2011 12:29:34 +0530 Message-ID: Subject: Re: Reading Records from a Sequence File To: common-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable On Fri, Apr 1, 2011 at 9:00 AM, maha wrote: > Hello Everyone, > > =A0 =A0 =A0 =A0As far as I know, when my java program opens a sequence fi= le for a map calculations, from hdfs. Using SequenceFile.Reader(key,value) = will actually read the file in dfs.block.size then grabes record-by-record = from memory. > > =A0Is that right? The dfs.block.size part is partially right when applied in MapReduce (actually, it would look for sync points for read start and read end). And no, the reader does not load the entire data in the memory in one-go. It buffers and reads off the stream just like any other reader. Could we have some more information on what your java program does, and what exactly you are measuring? :) --=20 Harsh J http://harshj.com