Return-Path: Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: (qmail 51243 invoked from network); 2 Dec 2010 10:30:37 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 2 Dec 2010 10:30:37 -0000 Received: (qmail 28984 invoked by uid 500); 2 Dec 2010 10:30:37 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 28952 invoked by uid 500); 2 Dec 2010 10:30:37 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 28740 invoked by uid 99); 2 Dec 2010 10:30:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 02 Dec 2010 10:30:36 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 02 Dec 2010 10:30:34 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oB2AUC5H000991 for ; Thu, 2 Dec 2010 10:30:12 GMT Message-ID: <19646270.65471291285812037.JavaMail.jira@thor> Date: Thu, 2 Dec 2010 05:30:12 -0500 (EST) From: "Lars George (JIRA)" To: issues@hbase.apache.org Subject: [jira] Commented: (HBASE-3295) Dropping a 1k+ regions table likely ends in a client socket timeout and it's very confusing In-Reply-To: <10361581.53201291231751236.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-3295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12966053#action_12966053 ] Lars George commented on HBASE-3295: ------------------------------------ Sorry Stack, did not see your earlier comment. If that client side spin works then that is what I meant. > Dropping a 1k+ regions table likely ends in a client socket timeout and it's very confusing > ------------------------------------------------------------------------------------------- > > Key: HBASE-3295 > URL: https://issues.apache.org/jira/browse/HBASE-3295 > Project: HBase > Issue Type: Bug > Reporter: Jean-Daniel Cryans > Fix For: 0.90.0 > > Attachments: 3295-v2.txt, 3295.txt > > > I tried truncating a 1.6k regions table from the shell and, after the usual disabling timeout, I then got a socket timeout on the second invocation while it was dropping. It looked like this: > {noformat} > ERROR: java.net.SocketTimeoutException: Call to sv2borg180/10.20.20.180:61000 failed on socket timeout exception: > java.net.SocketTimeoutException: 60000 millis timeout while waiting for channel to be ready for read. ch : > java.nio.channels.SocketChannel[connected local=/10.20.20.180:59153 remote=sv2borg180/10.20.20.180:61000] > {noformat} > At first I thought that was coming from the master because HDFS was somehow slow, but then understood that it was my socket that timed out meaning that the master was still dropping the table. Calling truncate again, I got: > {noformat} > ERROR: Unknown table TestTable! > {noformat} > Which means that the table would be deleted... I learned later that it wasn't totally deleted after I shut down the cluster. So it leaves me in a situation where I have to manually delete the files on the FS and the remaining .META. entries. > Since I expect a few people will hit this issue rather soon, for 0.90.0, I propose we just set the socket timeout really high in the shell. For 0.90.1, or 0.92, we should do for drop what we do for disabling. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.