Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6240418070 for ; Mon, 18 May 2015 09:22:00 +0000 (UTC) Received: (qmail 57121 invoked by uid 500); 18 May 2015 09:22:00 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 57066 invoked by uid 500); 18 May 2015 09:22:00 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 57054 invoked by uid 99); 18 May 2015 09:22:00 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 18 May 2015 09:22:00 +0000 Date: Mon, 18 May 2015 09:22:00 +0000 (UTC) From: "Walter Su (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Resolved] (HDFS-8339) Erasure Coding: Badly treated when createBlockOutputStream failed in DataStreamer MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-8339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Walter Su resolved HDFS-8339. ----------------------------- Resolution: Duplicate > Erasure Coding: Badly treated when createBlockOutputStream failed in DataStreamer > ---------------------------------------------------------------------------------- > > Key: HDFS-8339 > URL: https://issues.apache.org/jira/browse/HDFS-8339 > Project: Hadoop HDFS > Issue Type: Sub-task > Reporter: Walter Su > > h4. Issue 1: > leading streamer calls abandonBlock(..) and get new blockGroup from NN, primary streamer should sync with non-leading streamer instead of throw exception( offer twice to stripedBlock blockingQueue). > {noformat} > 2015-05-07 18:58:05,335 INFO hdfs.DataStreamer (DataStreamer.java:nextBlockOutputStream(1386)) - Abandoning BP-172584615-9.96.1.34-1430996280714:blk_-9223372036854775792_1001 > ... > 2015-05-07 18:58:05,373 WARN hdfs.DataStreamer (DataStreamer.java:run(572)) - DataStreamer Exception > java.io.IOException: Failed: LocatedBlock{BP-172584615-9.96.1.34-1430996280714:blk_-9223372036854775770_1002; getBlockSize()=0; corrupt=false; offset=1572864; locs=[DatanodeInfoWithStorage[127.0.0.1:52490,DS-6080b76f-adf7-45a8-aa0e-e0e82c2c1569,DISK]]}, i=6 > at org.apache.hadoop.hdfs.DFSStripedOutputStream$Coordinator.putStripedBlock(DFSStripedOutputStream.java:117) > at org.apache.hadoop.hdfs.StripedDataStreamer.locateFollowingBlock(StripedDataStreamer.java:120) > at org.apache.hadoop.hdfs.DataStreamer.nextBlockOutputStream(DataStreamer.java:1364) > at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:461) > at org.apache.hadoop.hdfs.StripedDataStreamer.run(StripedDataStreamer.java:48) > {noformat} > h4. Issue 2: > non-leading streamer calls abandonBlock(..) and get new locatedBlock from coordinator. Actually it's the last blockGroup, no more locatedBlock can poll from stripedBlocks blockingQueue. Other 8 streamer finished and closed, but this streamer hangs about 90 seconds. > {noformat} > 2015-05-07 19:21:25,357 INFO BlockStateChange (BlockManager.java:logAddStoredBlock(2768)) - BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:51998 is added to ... > 2015-05-07 19:22:55,250 WARN hdfs.DataStreamer (DataStreamer.java:run(572)) - DataStreamer Exception > java.io.IOException: Failed: i=1 > at org.apache.hadoop.hdfs.DFSStripedOutputStream$Coordinator.getStripedBlock(DFSStripedOutputStream.java:130) > at org.apache.hadoop.hdfs.StripedDataStreamer.locateFollowingBlock(StripedDataStreamer.java:124) > at org.apache.hadoop.hdfs.DataStreamer.nextBlockOutputStream(DataStreamer.java:1364) > at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:461) > at org.apache.hadoop.hdfs.StripedDataStreamer.run(StripedDataStreamer.java:48) > {noformat} > h4. Issue 3: > remove abandonBlock(..) RPC call for non-leading streamer -- This message was sent by Atlassian JIRA (v6.3.4#6332)