Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 51517 invoked from network); 13 Apr 2009 17:02:19 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 13 Apr 2009 17:02:19 -0000 Received: (qmail 1133 invoked by uid 500); 13 Apr 2009 17:02:09 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 98917 invoked by uid 500); 13 Apr 2009 17:01:03 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Delivered-To: moderator for general@hadoop.apache.org Received: (qmail 87899 invoked by uid 99); 13 Apr 2009 11:28:05 -0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ankur.goel@corp.aol.com designates 64.12.143.146 as permitted sender) Date: Mon, 13 Apr 2009 16:57:26 +0530 (IST) From: Ankur Goel To: general@hadoop.apache.org Message-ID: <19580353.281239622042153.JavaMail.ankur@localhost.localdomain> In-Reply-To: <10963932.261239621910111.JavaMail.ankur@localhost.localdomain> Subject: Re: [PROPOSAL] new subproject: Avro MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-OriginalArrivalTime: 13 Apr 2009 11:27:31.0156 (UTC) FILETIME=[D564A140:01C9BC2A] X-AOL-IP: 10.178.121.20 X-Virus-Checked: Checked by ClamAV on apache.org How fast do we expect the new serialization system to be when it replaces existing serialization mechanism in Hadoop RPC? A clear description of the existing bottlenecks and the performance goals for this system would help developers interested in contributing. -Ankur -------- Original Message -------- Subject: [PROPOSAL] new subproject: Avro Date: Thu, 02 Apr 2009 15:05:08 -0700 From: Doug Cutting Reply-To: general@hadoop.apache.org To: general@hadoop.apache.org I propose we add a new Hadoop subproject for Avro, a serialization system. My ambition is for Avro to replace both Hadoop's RPC and to be used for most Hadoop data files, e.g., by Pig, Hive, etc. Initial committers would be Sharad Agarwal and me, both existing Hadoop committers. We are the sole authors of this software to date. The code is currently at: http://people.apache.org/~cutting/avro.git/ To learn more: git clone http://people.apache.org/~cutting/avro.git/ avro cat avro/README.txt Comments? Questions? Doug