Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E40C7D2E4 for ; Thu, 27 Sep 2012 18:46:15 +0000 (UTC) Received: (qmail 91830 invoked by uid 500); 27 Sep 2012 18:46:13 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 91762 invoked by uid 500); 27 Sep 2012 18:46:13 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 91753 invoked by uid 99); 27 Sep 2012 18:46:13 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Sep 2012 18:46:13 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_FRT_STOCK2 X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of espinozca@oclc.org designates 132.174.29.209 as permitted sender) Received: from [132.174.29.209] (HELO mshieldserver1.oclc.org) (132.174.29.209) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 27 Sep 2012 18:46:05 +0000 Received: From OAEXCH4SERVER.oa.oclc.org ([132.174.29.227]) by mshieldserver1.oclc.org (WebShield SMTP v4.5 MR3) id 1348771544297; Thu, 27 Sep 2012 14:45:44 -0400 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01CD9CE0.4CBA22B9" Subject: Getting scans to timeout Date: Thu, 27 Sep 2012 14:45:43 -0400 Message-ID: <6548F17059905B48B2A6F28CE3692BAA0CE2973A@OAEXCH4SERVER.oa.oclc.org> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Getting scans to timeout Thread-Index: Ac2c4EyT1rSbquywQY6hmHDLrDTiZg== From: "Espinoza,Carlos" To: X-Virus-Checked: Checked by ClamAV on apache.org ------_=_NextPart_001_01CD9CE0.4CBA22B9 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi Thanks for you help. I've been doing this in a pseudo-distributed hbase-0.92.1 environment with one region server. I'm trying to scan a table and see it timeout. I'm trying to recreate a scenario where the RS is not responding (for instance due to NIC failure). So I've been issuing a 'kill -STOP' to the region server, and I expected the client to timeout but instead it just blocks at HTable.getScanner(). There is no output, no retries, nothing. I understand that I'm pausing the execution on the region server, but from a client perspective, I'm thinking that this should not matter. =20 My question is, is this a fair test? And if it is, any idea on how I can get it to not block? I've been playing around with client side settings, but no success. I've tried these settings (10sec) =20 conf.setInt("hbase.rpc.timeout", 10000); conf.setInt("hbase.client.operation.timeout", 10000); =20 I've also tried these HBaseClient.setSocketTimeout(this.conf, 10000); HBaseClient.setPingInterval(this.conf, 10000); =20 This is the jstack output of my application after I "STOP" the region server =20 "main" prio=3D10 tid=3D0x000000005c812000 nid=3D0x594e in Object.wait() [0x00000000410c4000] java.lang.Thread.State: WAITING (on object monitor) at java.lang.Object.wait(Native Method) - waiting on <0x00002aaae205ee80> (a org.apache.hadoop.hbase.ipc.HBaseClient$Call) at java.lang.Object.wait(Object.java:485) at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:904) - locked <0x00002aaae205ee80> (a org.apache.hadoop.hbase.ipc.HBaseClient$Call) at org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpc Engine.java:150) at $Proxy4.openScanner(Unknown Source) at org.apache.hadoop.hbase.client.ScannerCallable.openScanner(ScannerCallab le.java:120) at org.apache.hadoop.hbase.client.ScannerCallable.call(ScannerCallable.java :76) at org.apache.hadoop.hbase.client.ScannerCallable.call(ScannerCallable.java :39) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementat ion.getRegionServerWithRetries(HConnectionManager.java:1325) at org.apache.hadoop.hbase.client.HTable$ClientScanner.nextScanner(HTable.j ava:1246) at org.apache.hadoop.hbase.client.HTable$ClientScanner.initialize(HTable.ja va:1169) at org.apache.hadoop.hbase.client.HTable.getScanner(HTable.java:670) at org.apache.hadoop.hbase.client.HTablePool$PooledHTable.getScanner(HTable Pool.java:381) at org.oclc.higgins.hbase.util.HBaseUtils.getHBaseRegions(HBaseUtils.java:9 5) at org.oclc.higgins.hbase.snoop.Snoop.getCatalogRowsGroupedByRegionServer(S noop.java:392) at org.oclc.higgins.hbase.snoop.Snoop.watch(Snoop.java:318) at org.oclc.higgins.hbase.snoop.Snoop.main(Snoop.java:278) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav a:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor Impl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.RunJar.main(RunJar.java:197) =20 ------_=_NextPart_001_01CD9CE0.4CBA22B9--