Return-Path: X-Original-To: apmail-incubator-crunch-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-crunch-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CF6BE927C for ; Tue, 12 Jun 2012 20:48:37 +0000 (UTC) Received: (qmail 55028 invoked by uid 500); 12 Jun 2012 20:48:37 -0000 Delivered-To: apmail-incubator-crunch-dev-archive@incubator.apache.org Received: (qmail 54999 invoked by uid 500); 12 Jun 2012 20:48:37 -0000 Mailing-List: contact crunch-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: crunch-dev@incubator.apache.org Delivered-To: mailing list crunch-dev@incubator.apache.org Received: (qmail 54990 invoked by uid 99); 12 Jun 2012 20:48:37 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Jun 2012 20:48:37 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of tom@cloudera.com designates 209.85.160.47 as permitted sender) Received: from [209.85.160.47] (HELO mail-pb0-f47.google.com) (209.85.160.47) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Jun 2012 20:48:32 +0000 Received: by pbbrq2 with SMTP id rq2so1292704pbb.6 for ; Tue, 12 Jun 2012 13:48:12 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type:x-gm-message-state; bh=L6vznfMOGdxSFi+1oCVmOAQD7cUOn0clgxTSGofkuy8=; b=hkFOTSLa1g+UAWnL/cA2TNi/3Ig+D3tAqKxk1A0SUtde2f+Xm2nRyTO9+U9STuMjy+ ZOnlnZ75tBzPnEO3GRulvEn4TVJ57vikw4vaz7UH7HRBeleg+iwpze/5zpDRduKnW+rX fQ32lqydcpdyfNKm/HRHWkrRAUUYhPQB14vwgjDtXGhe5+VDPLex8zwWoTEcmfGmHqyF E+sBRaDo238OIQ5KMcqrmy4YubCrL0rarSw2FT75MhlckQ41ct8CMDb0uNGgok3lDiI3 PiU3X9NOkCG+MgeU/k+eAjTFJiZl4xVCLvDS34aK6uJzSA0OtQqLHDAvmZSPKqOhRciU 6KMA== Received: by 10.68.216.2 with SMTP id om2mr42664005pbc.26.1339534092348; Tue, 12 Jun 2012 13:48:12 -0700 (PDT) MIME-Version: 1.0 Received: by 10.68.4.227 with HTTP; Tue, 12 Jun 2012 13:47:44 -0700 (PDT) In-Reply-To: References: From: Tom White Date: Tue, 12 Jun 2012 15:47:44 -0500 Message-ID: Subject: Re: avro 1.7.0: should we upgrade crunch? To: Josh Wills Cc: crunch-dev@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQkC9t1tdoBlXPgj6L7T6Rhbqtv8H3PPmQ0mGsqASnPVXxaxVwqP8uHtOoe593/Ie7fy9zTV X-Virus-Checked: Checked by ClamAV on apache.org Funny - I was just writing an email about potential dragons... Before upgrading someone should check that Avro 1.7.0 works with released versions of Hadoop. In the past there have been problems with Avro and dependencies like Jackson conflicting with versions that Hadoop uses. In particular, the MR classpath can be controlled via the configuration property mapreduce.user.classpath.first and the env property HADOOP_USER_CLASSPATH_FIRST. By setting these to true MR will use the newer Avro libraries, however there is a risk that Hadoop will not work with the newer versions. Cheers, Tom On Tue, Jun 12, 2012 at 3:36 PM, Josh Wills wrote: > On Tue, Jun 12, 2012 at 1:28 PM, Gabriel Reid wrote: >> On Tue, Jun 12, 2012 at 10:14 PM, Josh Wills wrote: >>> >>> I saw Chris' tweet that Avro 1.7.0 was released-- thoughts on >>> upgrading? Does it buy us any good stuff we want? >> >> The fix of ReflectDatumReader not working correctly with Specific >> Records (https://issues.apache.org/jira/browse/AVRO-1046) would allow >> us to remove a fair bit of Avro code that works around that bug -- if >> we do do the upgrade, I'd certainly volunteer to weed out those >> workarounds. >> >> On the other hand, I just did a quick scan of the release notes for >> 1.7.0 (https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310911&version=12318848) >> and I didn't notice anything that would really be a big reason to >> upgrade right away. >> >> Are there any reasons *not* to upgrade (other than risks of something >> else being broken)? Maybe the cleanup of the Avro code that we can do >> is reason enough to do the upgrade. > > +tom explicitly > > I remember having a bunch of frustrations with 1.6.0 and having to > stay on 1.5.4 for longer than I wanted because of some critical bugs > that didn't get fixed until 1.6.2, but I also think the move from > 1.5.4 to 1.6.0 involved a much larger rewrite than what I see from the > release notes for 1.7.0. Tom White is traveling across the US right > now, but I'm wondering if he has a feel for whether 1.7.0 is likely to > contain any dragons. :) > >> >> - Gabriel > > > > -- > Director of Data Science > Cloudera > Twitter: @josh_wills