Return-Path: Delivered-To: apmail-lucene-hadoop-commits-archive@locus.apache.org Received: (qmail 74696 invoked from network); 16 Aug 2006 10:17:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 16 Aug 2006 10:17:32 -0000 Received: (qmail 53903 invoked by uid 500); 16 Aug 2006 10:17:32 -0000 Delivered-To: apmail-lucene-hadoop-commits-archive@lucene.apache.org Received: (qmail 53890 invoked by uid 500); 16 Aug 2006 10:17:31 -0000 Mailing-List: contact hadoop-commits-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-commits@lucene.apache.org Received: (qmail 53881 invoked by uid 99); 16 Aug 2006 10:17:31 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Aug 2006 03:17:31 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS X-Spam-Check-By: apache.org Received: from [192.87.106.226] (HELO ajax.apache.org) (192.87.106.226) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Aug 2006 03:17:31 -0700 Received: from ajax.apache.org (localhost [127.0.0.1]) by ajax.apache.org (Postfix) with ESMTP id 4AC98D4972 for ; Wed, 16 Aug 2006 11:17:10 +0100 (BST) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: Apache Wiki To: hadoop-commits@lucene.apache.org Date: Wed, 16 Aug 2006 10:17:10 -0000 Message-ID: <20060816101710.6652.30608@ajax.apache.org> Subject: [Lucene-hadoop Wiki] Trivial Update of "SequenceFile" by Arun C Murthy X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Dear Wiki user, You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification. The following page has been changed by Arun C Murthy: http://wiki.apache.org/lucene-hadoop/SequenceFile ------------------------------------------------------------------------------ Essentially there are 3 different file formats for !SequenceFiles depending on whether ''compression'' and ''block compression'' are active. - + [[BR]] - However any of the above formats share a common ''header'' (which is used by the !SequenceFile.Reader to return the appropriate key/value pairs). The next section summarises the header: + However all of the above formats share a common ''header'' (which is used by the !SequenceFile.Reader to return the appropriate key/value pairs). The next section summarises the header: + [[Anchor(SeqFileHeader)]] - [[Anchor(SeqFileHeader)]]===== SequenceFile Common Header ===== + ===== SequenceFile Common Header ===== * version - A byte array: SEQ * keyClassName - String * valueClassName - String @@ -30, +31 @@ * blockCompression - A boolean which specifies if ''block compression'' is turned on for keys/values in this file. * sync - A sync marker to denote end of the header. - + [[BR]] The formats for Uncompressed/!RecordCompressed Writers are very similar: ===== Uncompressed/RecordCompressed Writer Format ===== * [#SeqFileHeader Header] @@ -38, +39 @@ * Key * (Compressed?) Value * A sync-marker every 100bytes or so to help in seeking to a random point in the file and then seeking to next ''record''. -
+ [[BR]] The format for the !BlockCompressedWriter is as follows: ===== BlockCompressed Writer Format ===== * [#SeqFileHeader Header]