Return-Path: X-Original-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 78343DD24 for ; Wed, 26 Sep 2012 17:54:59 +0000 (UTC) Received: (qmail 39337 invoked by uid 500); 26 Sep 2012 17:54:58 -0000 Delivered-To: apmail-hadoop-hdfs-dev-archive@hadoop.apache.org Received: (qmail 39240 invoked by uid 500); 26 Sep 2012 17:54:58 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-dev@hadoop.apache.org Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 39230 invoked by uid 99); 26 Sep 2012 17:54:58 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Sep 2012 17:54:58 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of eli@cloudera.com designates 74.125.82.48 as permitted sender) Received: from [74.125.82.48] (HELO mail-wg0-f48.google.com) (74.125.82.48) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Sep 2012 17:54:53 +0000 Received: by wgbds1 with SMTP id ds1so621145wgb.29 for ; Wed, 26 Sep 2012 10:54:32 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=X/G8XgxopxSB2IWHMJJl6yKpPI6BldocGCSjDvTWl2o=; b=UeN6OQK27bWbrXnRY/Of0nenOM8Kb+MFIIT4kiB3uUBOBZRDf5NyDBGzMc7E8PPH80 SmQpnxHVIQoLK6qKMJzzdM+DHoz6EZcViALV2YB8Zkys3hvp++bThFRjjc4jrXn0TApH LYgRmMXeWq+RSQD9TshJmQsK1zSxHWcS2RrdGQLHcOBNcuq4Ik0iMBxBmCuhoh9pWs7t LE0ynuk/jHV3BCEK4Pswit4c05hyHDjpNxjtE4fdRIyFqVG9yENAVlcPxLT20lixBo2P M11wD0kAXtie/QUUITR7KWyM7EYtsChc92WkEUORI0mRrZeK/4+aCt4ElZIe9w/I3yrZ QeBA== MIME-Version: 1.0 Received: by 10.180.106.130 with SMTP id gu2mr2916794wib.20.1348682072242; Wed, 26 Sep 2012 10:54:32 -0700 (PDT) Received: by 10.223.61.18 with HTTP; Wed, 26 Sep 2012 10:54:32 -0700 (PDT) In-Reply-To: References: Date: Wed, 26 Sep 2012 10:54:32 -0700 Message-ID: Subject: Re: [VOTE] Merge HDFS-3077 (QuorumJournalManager) branch to trunk From: Eli Collins To: hdfs-dev@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQkcvgcVIQjaSBm+W7x2wv3AnCGTitdZQ/4XfpMtwHHYV0AzfTG/C4k4eT5fPp8tc9TgZtSM X-Virus-Checked: Checked by ClamAV on apache.org On Wed, Sep 26, 2012 at 10:50 AM, Todd Lipcon wrote: > On Tue, Sep 25, 2012 at 11:21 PM, Konstantin Shvachko > wrote: >> I think this is a great work, Todd. >> And I think we should not merge it into trunk or other branches. >> As I suggested earlier on this list I think this should be spinned off >> as a separate project or a subproject. >> >> - The code is well detached as a self contained package. > > The addition is mostly self-contained, but it makes use of a bunch of > "private" parts of HDFS and Common: > - Reuses all of the Hadoop security infrastructure, IPC, metrics, etc > - Coupled to the JournalManager interface which is still evolving. In > fact there were several patches in trunk which were done during the > development of this project, specifically to make this API more > general. There's still some further work to be done in this area on > the generic interface -- eg support for upgrade/rollback. > - The functional tests make use of a bunch of "private" HDFS APIs as well. > >> - It is a logically stand-alone project that can be replaced by other >> technologies. >> - If it is a separate project then there is no need to port it to >> other versions. You can package it as a dependent jar. > > Per above, it's not that separate, because in order to build it, we > had to make a number of changes to core HDFS internal interfaces. It > currently couldn't be used to store anything except for NN logs. It > would be a nice extension to truly separate it out into a > content-agnostic quorum-based edit log, but today it actually uses the > existing edit log validation code to determine valid lengths, etc. > >> - Finally, it will be a good precedent of spinning new projects out of >> HDFS rather than bringing everything under HDFS umbrella. >> >> Todd, I had a feeling you were in favor of this direction? > > I'm not in favor of it - I had said previously that it's worth > discussing if several other people believe the same. I'm not in favor of it either. One of the benefits of QJM over the BK approach is that it's embedded in HDFS (ie not treated as a separate storage system). See HDFS-3077 and HDFS-3092 for details on that discussion.