Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 15D16200D2D for ; Fri, 27 Oct 2017 22:40:07 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 143F81609DD; Fri, 27 Oct 2017 20:40:07 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 59786160BF2 for ; Fri, 27 Oct 2017 22:40:06 +0200 (CEST) Received: (qmail 8014 invoked by uid 500); 27 Oct 2017 20:40:05 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 7769 invoked by uid 99); 27 Oct 2017 20:40:05 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 Oct 2017 20:40:05 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 54E7CC4868 for ; Fri, 27 Oct 2017 20:40:04 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id amJFIaIuoaqj for ; Fri, 27 Oct 2017 20:40:03 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 142FF5FB2D for ; Fri, 27 Oct 2017 20:40:03 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 54DC9E00B3 for ; Fri, 27 Oct 2017 20:40:02 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 34C2C212F5 for ; Fri, 27 Oct 2017 20:40:00 +0000 (UTC) Date: Fri, 27 Oct 2017 20:40:00 +0000 (UTC) From: "Sean Mackrory (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HDFS-11096) Support rolling upgrade between 2.x and 3.x MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 27 Oct 2017 20:40:07 -0000 [ https://issues.apache.org/jira/browse/HDFS-11096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Mackrory updated HDFS-11096: --------------------------------- Attachment: HDFS-11096.007.patch Okay - I'm now pretty happy with how this is working. I saw the last shellcheck problems locally, and have fixed those, too. I've had several successful test runs of both of the Docker tests in the last few days, and this is looking pretty reliable to me: * Versions to test are now specified via CLI args to the Docker scripts. That way this only has to change in code when there's a bug to fix or other improvement to make: Jenkins jobs can be updated for various version combinations independently. * Fixing more ZK timeouts, this time in YARN. I've disabled the YARN rolling upgrade as that appears to be troublesome again. But the HDFS upgrade is working and YARN / MR is working well during and after that upgrade. I'll keep troubleshooting the YARN side, but that can be a separate JIRA. * Logs are now saved to ./logs/ back on the host to facilitate more debugging after the Docker images have been destroyed in the event of a failure. Although I've made a number of fixes as documented in the comments, not much has changed that would invalidate the value of previous code reviews, IMO. [~aw] - have I addressed the issues you pointed out to your satisfaction? > Support rolling upgrade between 2.x and 3.x > ------------------------------------------- > > Key: HDFS-11096 > URL: https://issues.apache.org/jira/browse/HDFS-11096 > Project: Hadoop HDFS > Issue Type: Improvement > Components: rolling upgrades > Affects Versions: 3.0.0-alpha1 > Reporter: Andrew Wang > Assignee: Sean Mackrory > Priority: Blocker > Attachments: HDFS-11096.001.patch, HDFS-11096.002.patch, HDFS-11096.003.patch, HDFS-11096.004.patch, HDFS-11096.005.patch, HDFS-11096.006.patch, HDFS-11096.007.patch > > > trunk has a minimum software version of 3.0.0-alpha1. This means we can't rolling upgrade between branch-2 and trunk. > This is a showstopper for large deployments. Unless there are very compelling reasons to break compatibility, let's restore the ability to rolling upgrade to 3.x releases. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org