Return-Path: X-Original-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 432FF9CE1 for ; Thu, 22 Mar 2012 17:25:58 +0000 (UTC) Received: (qmail 44702 invoked by uid 500); 22 Mar 2012 17:25:57 -0000 Delivered-To: apmail-hadoop-hdfs-dev-archive@hadoop.apache.org Received: (qmail 44456 invoked by uid 500); 22 Mar 2012 17:25:57 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-dev@hadoop.apache.org Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 44447 invoked by uid 99); 22 Mar 2012 17:25:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Mar 2012 17:25:57 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of eli@cloudera.com designates 209.85.215.48 as permitted sender) Received: from [209.85.215.48] (HELO mail-lpp01m010-f48.google.com) (209.85.215.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Mar 2012 17:25:50 +0000 Received: by lagu2 with SMTP id u2so2526107lag.35 for ; Thu, 22 Mar 2012 10:25:29 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=MaIeZLeYr5NDK4sImbVZ5gjfqhNEOqX3ReZCVrYozwM=; b=cA3zyY2Xa06E+CQfmmmtDT5dVwFAYJe1jM3JxJ8WrjU25K/B9M5TjOxynovWbXgpDc S1bzFB6AC3mYK/lems4FoF1C8IVHLJR5xQU0vuVPswY2Blitl3Os6ip08+r9T3KAyGmp wJ5Bt9yPapXGA9siNAd8JPyjlCxXGoEWlubAmSrT3aYNbv0B4iNiOarJzJkoUTuzVIJH 26+aEu+VXSIuisr33MycH4D5UuIzZabbRLWfajNoEFRJFysCSKHJhfp8agTYSe+Cljt2 E+dcOA69Ducc0SXk/gQ0+m8Q8YUnMAXD0XnHANhbgL/rUd9ZYSg8295NguXzZaP7eNFA ZkMg== MIME-Version: 1.0 Received: by 10.152.145.135 with SMTP id su7mr6419498lab.5.1332437129508; Thu, 22 Mar 2012 10:25:29 -0700 (PDT) Received: by 10.112.102.136 with HTTP; Thu, 22 Mar 2012 10:25:29 -0700 (PDT) In-Reply-To: References: Date: Thu, 22 Mar 2012 10:25:29 -0700 Message-ID: Subject: Re: [DISCUSS] Remove append? From: Eli Collins To: hdfs-dev@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQlT5P50P32i9WdTvo6NPSW10yjgrYVRIo7Zj+Na1OPKeZcsn04FuFL7+YZBgWuD2P1nV0D6 X-Virus-Checked: Checked by ClamAV on apache.org On Thu, Mar 22, 2012 at 1:26 AM, Konstantin Shvachko wrote: > Eli, > > I went over the entire discussion on the topic, and did not get it. Is > there a problem with append? We know it does not work in hadoop-1, > only flush() does. Is there anything wrong with the new append > (HDFS-265)? If so please file a bug. > I tested it in Hadoop-0.22 branch it works fine. > > I agree with people who were involved with the implementation of the > new append that the complexity is mainly in > 1. pipeline recovery > 2. consistent client reading while writing, and > 3. hflush() > Once it is done the append itself, which is reopening of previously > closed files for adding data, is not complex. > I agree that much of the complexity is in #1-3 above, which is why HDFS-265 is leveraged. The primary simplicity of not having append (and truncate) comes from not leveraging the invariant that finalized blocks are immutable, that blocks once written won't eg shrink in size (which we assume today). > You mentioned it and I agree you indeed should be more involved with > your customer base. As for eBay, append was of the motivations to work > on stabilizing 0.22 branch. And there is a lot of use cases which > require append for our customers. > Some of them were mentioned in this discussion. > >From what I've seen 0.22 isn't ready for production use. Aside from not supporting critical features like security, it doesn't have a size-able user-base behind it testing and fixing bugs, etc. All things I'd imagine an org like eBay would want. I've never gotten a request to support 0.22 from a customer. Thanks, Eli