Return-Path: Delivered-To: apmail-hadoop-avro-dev-archive@minotaur.apache.org Received: (qmail 24124 invoked from network); 1 Apr 2010 00:25:04 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 1 Apr 2010 00:25:04 -0000 Received: (qmail 70524 invoked by uid 500); 1 Apr 2010 00:25:04 -0000 Delivered-To: apmail-hadoop-avro-dev-archive@hadoop.apache.org Received: (qmail 70384 invoked by uid 500); 1 Apr 2010 00:25:04 -0000 Mailing-List: contact avro-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: avro-dev@hadoop.apache.org Delivered-To: mailing list avro-dev@hadoop.apache.org Received: (qmail 70376 invoked by uid 99); 1 Apr 2010 00:25:04 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Apr 2010 00:25:04 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [216.145.54.172] (HELO mrout2.yahoo.com) (216.145.54.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Apr 2010 00:24:56 +0000 Received: from SNV-EXPF01.ds.corp.yahoo.com (snv-expf01.ds.corp.yahoo.com [207.126.227.250]) by mrout2.yahoo.com (8.13.6/8.13.6/y.out) with ESMTP id o310NWA5073236 for ; Wed, 31 Mar 2010 17:23:32 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=received:user-agent:date:subject:from:to:message-id: thread-topic:thread-index:mime-version:content-type: content-transfer-encoding:x-originalarrivaltime; b=TQXkXlXRECZmpQWxwB9gNfClVUQHslDFnlPCFuWNkkdhgFMhWGo31sOBI7iK7gAq Received: from SNV-EXVS07.ds.corp.yahoo.com ([207.126.227.5]) by SNV-EXPF01.ds.corp.yahoo.com with Microsoft SMTPSVC(6.0.3790.3959); Wed, 31 Mar 2010 17:23:31 -0700 Received: from 10.72.109.205 ([10.72.109.205]) by SNV-EXVS07.ds.corp.yahoo.com ([207.126.227.142]) via Exchange Front-End Server snv-webmail.corp.yahoo.com ([207.126.227.60]) with Microsoft Exchange Server HTTP-DAV ; Thu, 1 Apr 2010 00:23:31 +0000 User-Agent: Microsoft-Entourage/12.23.0.091001 Date: Wed, 31 Mar 2010 17:23:29 -0700 Subject: clarifications on file format From: Scott Banachowski To: avro-dev Message-ID: Thread-Topic: clarifications on file format Thread-Index: AcrRMY1vGyWBavFlvUeaMFaV72USyw== Mime-version: 1.0 Content-type: text/plain; charset="US-ASCII" Content-transfer-encoding: 7bit X-OriginalArrivalTime: 01 Apr 2010 00:23:31.0981 (UTC) FILETIME=[8F36B3D0:01CAD131] X-Virus-Checked: Checked by ClamAV on apache.org Hi, I'm looking at the spec for the container file, and have 2 questions: The map of metadata key/value pairs begins with a long, then a number of string-key/bytes-value pairs. To be consistent with avro maps, should this be followed by a long of 0? The spec doesn't say explicitly, but if the header is described by an avro schema I would suspect yes. Are the longs that describe the file block varint longs? Or 64-bit longs? I assume avro varints. But if so, if you ever wanted to expand the size of block by writing more objects to it, you'd be in trouble because you'd potentially be unable to fit the new size in the varint's location. Also, I looked around the repo for some example container files, but didn't see any. Are there any examples checked in that we can use to examine their layout and test our readers? thanks, Scott