Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id BF0D1200BA0 for ; Fri, 14 Oct 2016 17:02:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id BDF04160AD0; Fri, 14 Oct 2016 15:02:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 1A2D2160AD9 for ; Fri, 14 Oct 2016 17:02:21 +0200 (CEST) Received: (qmail 98972 invoked by uid 500); 14 Oct 2016 15:02:20 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 98917 invoked by uid 99); 14 Oct 2016 15:02:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Oct 2016 15:02:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id A8F772C4C7A for ; Fri, 14 Oct 2016 15:02:20 +0000 (UTC) Date: Fri, 14 Oct 2016 15:02:20 +0000 (UTC) From: "Xiaowei Zhu (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HDFS-11014) libhdfs++: Make connection to HA clusters faster MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 14 Oct 2016 15:02:22 -0000 [ https://issues.apache.org/jira/browse/HDFS-11014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15575593#comment-15575593 ] Xiaowei Zhu commented on HDFS-11014: ------------------------------------ +1. The changes all make good sense. > libhdfs++: Make connection to HA clusters faster > ------------------------------------------------ > > Key: HDFS-11014 > URL: https://issues.apache.org/jira/browse/HDFS-11014 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: hdfs-client > Reporter: James Clampffer > Assignee: James Clampffer > Priority: Minor > Attachments: HDFS-11014.HDFS-8707.000.patch > > > Right now when we get a StandbyException from the NN we inject a 20 second delay before we try the alternate NN even if it's the first failover. The first failover shouldn't have a delay (java client skips delay on first failover). > Another minor change I'd like to make is to reduce the default number of failover attempts from 15 (used in the apache config) to 4. My impression is that higher numbers of failovers are really handy for longer running batch jobs but in the libhdfs++ case the client is often an interactive application. In this case it's generally preferable to fail sooner so a user doesn't have to wait the ~8 minutes to time out when using default settings. > 4 failovers is based on the assumption that if we can't immediately connect there is either a GC pause which will most likely be finished before the second connection attempt or it's a network or config issue that will take some sorting out by an admin. It'd still be possible to override these in the config for more tuning if a specific deployment tends to have more or less network issues. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org