Return-Path: X-Original-To: apmail-avro-dev-archive@www.apache.org Delivered-To: apmail-avro-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E8853C3F for ; Fri, 31 Aug 2012 19:19:08 +0000 (UTC) Received: (qmail 80464 invoked by uid 500); 31 Aug 2012 19:19:08 -0000 Delivered-To: apmail-avro-dev-archive@avro.apache.org Received: (qmail 80413 invoked by uid 500); 31 Aug 2012 19:19:08 -0000 Mailing-List: contact dev-help@avro.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@avro.apache.org Delivered-To: mailing list dev@avro.apache.org Received: (qmail 80401 invoked by uid 99); 31 Aug 2012 19:19:08 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 31 Aug 2012 19:19:08 +0000 Date: Sat, 1 Sep 2012 06:19:08 +1100 (NCT) From: "Scott Carey (JIRA)" To: dev@avro.apache.org Message-ID: <521683659.24248.1346440748649.JavaMail.jiratomcat@arcas> In-Reply-To: <745708165.68509.1303255565721.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (AVRO-806) add a column-major codec for data files MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/AVRO-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13446273#comment-13446273 ] Scott Carey commented on AVRO-806: ---------------------------------- I think (1) is the best way to start. We could easily transition to (2) if that made sense due to other language implementations, and (3) if it grows big enough. We may want to identify it separately as 'evolving' or similar so that API changes in the next couple releases if needed can be managed more flexibly. > add a column-major codec for data files > --------------------------------------- > > Key: AVRO-806 > URL: https://issues.apache.org/jira/browse/AVRO-806 > Project: Avro > Issue Type: New Feature > Components: java, spec > Reporter: Doug Cutting > Assignee: Doug Cutting > Attachments: AVRO-806.patch, AVRO-806-v2.patch, avro-file-columnar.pdf > > > Define a codec that, when a data file's schema is a record schema, writes blocks within the file in column-major order. This would permit better compression and also permit efficient skipping of fields that are not of interest. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira