Mailing-List: contact user-help@avro.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@avro.apache.org
Received-SPF: softfail (nike.apache.org: transitioning domain of
 sam@mefford.org does not designate 216.139.236.26 as permitted sender)
Date: Thu, 21 Mar 2013 11:26:18 -0700 (PDT)
From: sammefford <sam@mefford.org>
To: user@avro.apache.org
Message-ID: <1363890378675-4026663.post@n3.nabble.com>
Subject: Where are the rows in Trevni format?
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

I read the Trevni Specificaiton:
http://avro.apache.org/docs/1.7.4/trevni/spec.html
and I can't see where the row ids are stored for each value in each column. 
Am I missing something obvious?  Is the spec incomplete on that point?

Also, to confirm, my understanding is columnar formats are efficient because
they store column values sorted and can thereby find specific values or
ranges of values quickly.  While the spec mentions the benefits of sorting,
I don't see a requirement that column values be sorted.  Can we depend that
the blocks of column values are sorted?

Thanks,

Sam Mefford
Chief Architect-Big Data Solutions
Avalon Consluting, LLC.
801-706-9731


--
View this message in context: http://apache-avro.679487.n3.nabble.com/Where-are-the-rows-in-Trevni-format-tp4026663.html
Sent from the Avro - Users mailing list archive at Nabble.com.