Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 59204 invoked from network); 30 Aug 2007 04:40:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 30 Aug 2007 04:40:54 -0000 Received: (qmail 53694 invoked by uid 500); 30 Aug 2007 04:40:49 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 53374 invoked by uid 500); 30 Aug 2007 04:40:48 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 53365 invoked by uid 99); 30 Aug 2007 04:40:48 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Aug 2007 21:40:48 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Aug 2007 04:41:50 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 8C4C3714204 for ; Wed, 29 Aug 2007 21:40:30 -0700 (PDT) Message-ID: <7874814.1188448830561.JavaMail.jira@brutus> Date: Wed, 29 Aug 2007 21:40:30 -0700 (PDT) From: "stack (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Updated: (HADOOP-1816) [hbase] Scan of .META. does socket timeout over and over again (rather than In-Reply-To: <14057506.1188448711071.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HADOOP-1816: -------------------------- Attachment: excerpt.txt Log excerpt illustrating the problem. > [hbase] Scan of .META. does socket timeout over and over again (rather than > ---------------------------------------------------------------------------- > > Key: HADOOP-1816 > URL: https://issues.apache.org/jira/browse/HADOOP-1816 > Project: Hadoop > Issue Type: Bug > Components: contrib/hbase > Reporter: stack > Priority: Trivial > Attachments: excerpt.txt > > > A mismatch in the code on the cluster revealed an infinite loop. The .META. scanner is doing a socket timeout trying to contact a borked region server (The borked server was having trouble contacting hdfs because of of code version mismatch -- it was sort-of-working). We retry the timeout up to the retry limit but then rather than try and redeploy the unreachable .META. we just drop back into scanning at the old location.... I'll attach a log that illustrates the goings-on. > I think this likely a trivial issue since it shouldn't really ever happen.... -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.