Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 1BAAF200C88 for ; Fri, 2 Jun 2017 19:33:09 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 18DF1160BBA; Fri, 2 Jun 2017 17:33:09 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 60253160BD2 for ; Fri, 2 Jun 2017 19:33:08 +0200 (CEST) Received: (qmail 32330 invoked by uid 500); 2 Jun 2017 17:33:07 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 32319 invoked by uid 99); 2 Jun 2017 17:33:07 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Jun 2017 17:33:07 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 0538B182883 for ; Fri, 2 Jun 2017 17:33:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id wmZtBqhJzY06 for ; Fri, 2 Jun 2017 17:33:06 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 81C925FB80 for ; Fri, 2 Jun 2017 17:33:05 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id CF302E0D27 for ; Fri, 2 Jun 2017 17:33:04 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 319E021B5B for ; Fri, 2 Jun 2017 17:33:04 +0000 (UTC) Date: Fri, 2 Jun 2017 17:33:04 +0000 (UTC) From: "Wei-Chiu Chuang (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HDFS-11914) Add more diagnosis info for fsimage transfer failure. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 02 Jun 2017 17:33:09 -0000 [ https://issues.apache.org/jira/browse/HDFS-11914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16035093#comment-16035093 ] Wei-Chiu Chuang commented on HDFS-11914: ---------------------------------------- Hi [~yzhangal] I reviewed the patch and mostly good. I suggest improve the readability of the log a little bit: in {{copyFileToStream}}: "Connection closed by client. Sent total=". add " bytes." at the end. " Size of last segment possibly sent=" can be rephrased as " Size of last segment intended to send=" Could you also explain why use String.valueOf() to print fsImageName, a String? Thanks! > Add more diagnosis info for fsimage transfer failure. > ----------------------------------------------------- > > Key: HDFS-11914 > URL: https://issues.apache.org/jira/browse/HDFS-11914 > Project: Hadoop HDFS > Issue Type: Improvement > Reporter: Yongjun Zhang > Assignee: Yongjun Zhang > Labels: supportability > Attachments: HDFS-11914.001.patch, HDFS-11914.002.patch > > > Hit a fsimage download problem: > Client tries to download fsimage, and got: > WARN org.apache.hadoop.security.UserGroupInformation: PriviledgedActionException as:hdfs (auth:SIMPLE) cause:java.io.IOException: File http://x.y.z:50070/imagetransfer?getimage=1&txid=latest received length xyz is not of the advertised size abc. > Basically client does not get enough fsimage data and finished prematurely without any exception thrown, until it finds the size of data received is smaller than expected. The client then closed the conenction to NN, that caused NN to report > INFO org.apache.hadoop.hdfs.server.namenode.TransferFsImage: Connection closed by client > This jira is to add some more information in logs to help debugging the sitaution. Specifically, report the stack trace when the connection is closed. And how much data has been sent at that point. etc. > -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org