Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 81276 invoked from network); 31 Jan 2011 23:24:00 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 31 Jan 2011 23:24:00 -0000 Received: (qmail 25778 invoked by uid 500); 31 Jan 2011 23:23:58 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 25716 invoked by uid 500); 31 Jan 2011 23:23:58 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 25708 invoked by uid 99); 31 Jan 2011 23:23:58 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 31 Jan 2011 23:23:58 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of todd@cloudera.com designates 209.85.210.176 as permitted sender) Received: from [209.85.210.176] (HELO mail-iy0-f176.google.com) (209.85.210.176) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 31 Jan 2011 23:23:51 +0000 Received: by iyb26 with SMTP id 26so6038396iyb.35 for ; Mon, 31 Jan 2011 15:23:30 -0800 (PST) Received: by 10.231.207.84 with SMTP id fx20mr7337587ibb.62.1296516210476; Mon, 31 Jan 2011 15:23:30 -0800 (PST) MIME-Version: 1.0 Received: by 10.231.153.19 with HTTP; Mon, 31 Jan 2011 15:23:10 -0800 (PST) In-Reply-To: References: From: Todd Lipcon Date: Mon, 31 Jan 2011 15:23:10 -0800 Message-ID: Subject: Re: [DISCUSS] Move common, hdfs, mapreduce contrib components to apache-extras.org or elsewhere To: general@hadoop.apache.org Content-Type: multipart/alternative; boundary=90e6ba4fc5142bc6e4049b2cb420 X-Virus-Checked: Checked by ClamAV on apache.org --90e6ba4fc5142bc6e4049b2cb420 Content-Type: text/plain; charset=ISO-8859-1 On Sun, Jan 30, 2011 at 11:19 PM, Owen O'Malley wrote: > > Also note that pushing code out of Hadoop has a high cost. There are at > least 3 forks of the hadoop-gpl-compression code. That creates a lot of > confusion for the users. A lot of users never go to the work to figure out > which fork and branch of hadoop-gpl-compression work with the version of > Hadoop they installed. > > Indeed it creates confusion, but in my opinion it has been very successful modulo that confusion. In particular, Kevin and I (who each have a repo on github but basically co-maintain a branch) have done about 8 bugfix releases of LZO in the last year. The ability to take a bug and turn it around into a release within a few days has been very beneficial to the users. If it were part of core Hadoop, people would be forced to live with these blocker bugs for months at a time between dot releases. IMO the more we can take non-core components and move them to separate release timelines, the better. Yes, it is harder for users, but it also is easier for them when they hit a bug - they don't have to wait months for a wholesale upgrade which might contain hundreds of other changes to core components. I think this will also help the situation where people have set up shop on branches -- a lot of the value of these branches comes from the frequency of backports and bugfixes to "non-core" components. If the non-core stuff were on a faster timeline upstream, we could maintain core stability while also offering people the latest and greatest libraries, tools, codecs, etc. -Todd -- Todd Lipcon Software Engineer, Cloudera --90e6ba4fc5142bc6e4049b2cb420--