Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 33771 invoked from network); 4 Aug 2009 23:22:34 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Aug 2009 23:22:34 -0000 Received: (qmail 81200 invoked by uid 500); 4 Aug 2009 23:22:39 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 81154 invoked by uid 500); 4 Aug 2009 23:22:39 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 81144 invoked by uid 99); 4 Aug 2009 23:22:39 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Aug 2009 23:22:39 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Aug 2009 23:22:36 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id D1E7329A0013 for ; Tue, 4 Aug 2009 16:22:14 -0700 (PDT) Message-ID: <658979050.1249428134844.JavaMail.jira@brutus> Date: Tue, 4 Aug 2009 16:22:14 -0700 (PDT) From: "Jakob Homan (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Commented: (HDFS-167) DFSClient continues to retry indefinitely MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12739240#action_12739240 ] Jakob Homan commented on HDFS-167: ---------------------------------- I tend to agree with Bill. I don't see any harm in having the extra ctor. Moreover, since this patch has already been applied to trunk, the new change would only be introduced to 20, which seems like not the best idea. > DFSClient continues to retry indefinitely > ----------------------------------------- > > Key: HDFS-167 > URL: https://issues.apache.org/jira/browse/HDFS-167 > Project: Hadoop HDFS > Issue Type: Bug > Components: hdfs client > Reporter: Derek Wollenstein > Assignee: Bill Zeller > Priority: Minor > Fix For: 0.21.0 > > Attachments: hdfs-167-4.patch, hdfs-167-5.patch, hdfs-167-6.patch, hdfs-167-for-20-1.patch > > > I encountered a bug when trying to upload data using the Hadoop DFS Client. > After receiving a NotReplicatedYetException, the DFSClient will normally retry its upload up to some limited number of times. In this case, I found that this retry loop continued indefinitely, to the point that the number of tries remaining was negative: > 2009-03-25 16:20:02 [INFO] > 2009-03-25 16:20:02 [INFO] 09/03/25 16:20:02 INFO hdfs.DFSClient: Waiting for replication for 21 seconds > 2009-03-25 16:20:03 [INFO] 09/03/25 16:20:02 WARN hdfs.DFSClient: NotReplicatedYetException sleeping /apollo/env/SummaryMySQL/var/logstore/fiorello_logs_2009 > 0325_us/logs_20090325_us_13 retries left -1 > The stack trace for the failure that's retrying is: > 2009-03-25 16:20:02 [INFO] 09/03/25 16:20:02 INFO hdfs.DFSClient: org.apache.hadoop.ipc.RemoteException: org.apache.hadoop.hdfs.server.namenode.NotReplicated > YetException: Not replicated yet: > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:1266) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.hdfs.server.namenode.NameNode.addBlock(NameNode.java:351) > 2009-03-25 16:20:02 [INFO] at sun.reflect.GeneratedMethodAccessor19.invoke(Unknown Source) > 2009-03-25 16:20:02 [INFO] at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > 2009-03-25 16:20:02 [INFO] at java.lang.reflect.Method.invoke(Method.java:597) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:481) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.ipc.Server$Handler.run(Server.java:894) > 2009-03-25 16:20:02 [INFO] > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.ipc.Client.call(Client.java:697) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:216) > 2009-03-25 16:20:02 [INFO] at $Proxy0.addBlock(Unknown Source) > 2009-03-25 16:20:02 [INFO] at sun.reflect.GeneratedMethodAccessor2.invoke(Unknown Source) > 2009-03-25 16:20:02 [INFO] at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > 2009-03-25 16:20:02 [INFO] at java.lang.reflect.Method.invoke(Method.java:597) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:82) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:59) > 2009-03-25 16:20:02 [INFO] at $Proxy0.addBlock(Unknown Source) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.locateFollowingBlock(DFSClient.java:2814) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.nextBlockOutputStream(DFSClient.java:2696) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.access$2000(DFSClient.java:1996) > 2009-03-25 16:20:02 [INFO] at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2183) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.