Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EC272200C39 for ; Thu, 2 Mar 2017 00:18:52 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id EAB7C160B78; Wed, 1 Mar 2017 23:18:52 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3EC05160B70 for ; Thu, 2 Mar 2017 00:18:52 +0100 (CET) Received: (qmail 5034 invoked by uid 500); 1 Mar 2017 23:18:51 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 5023 invoked by uid 99); 1 Mar 2017 23:18:51 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Mar 2017 23:18:51 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id D6F021895B9 for ; Wed, 1 Mar 2017 23:18:50 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -1.547 X-Spam-Level: X-Spam-Status: No, score=-1.547 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-2.999, SPF_NEUTRAL=0.652] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id JWUzP4HyJRWX for ; Wed, 1 Mar 2017 23:18:49 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id EBD7F5F36B for ; Wed, 1 Mar 2017 23:18:48 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 341DAE0A0D for ; Wed, 1 Mar 2017 23:18:46 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 735502416A for ; Wed, 1 Mar 2017 23:18:45 +0000 (UTC) Date: Wed, 1 Mar 2017 23:18:45 +0000 (UTC) From: "Andrew Purtell (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (HBASE-17714) Client heartbeats seems to be broken MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 01 Mar 2017 23:18:53 -0000 [ https://issues.apache.org/jira/browse/HBASE-17714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15891268#comment-15891268 ] Andrew Purtell edited comment on HBASE-17714 at 3/1/17 11:18 PM: ----------------------------------------------------------------- e5c1a80 (HBASE-15645 hbase.rpc.timeout is not used in operations of HTable) is the commit that causes RenewLeaseIT to start failing. You have to apply HBASE-16420 (Fix source incompatibility of Table interface) after checking out e5c1a80 to get something that 4.x-HBase-1.1 will compile against. HBASE-15645 was shipped in 1.1.5, 1.2.2, and 1.3.0. What this change does is fix where the client was not actually honoring RPC timeouts prior to the change. [~samarthjain] are you sure RenewLeaseIT actually renews the lease or allows for a client heartbeat to happen before the RPC times out? The test sets a very short RPC timeout (2000ms) but makes no other configuration changes was (Author: apurtell): e5c1a80 (HBASE-15645 hbase.rpc.timeout is not used in operations of HTable) is the commit that causes RenewLeaseIT to start failing. You have to apply HBASE-16420 (Fix source incompatibility of Table interface) after checking out e5c1a80 to get something that 4.x-HBase-1.1 will compile against. HBASE-15645 was shipped in 1.1.5, 1.2.2, and 1.3.0. What this change does is fix where the client was not actually honoring RPC timeouts prior to the change. [~samarthjain] are you sure RenewLeaseIT actually renews the lease before the RPC times out? The test sets a very short RPC timeout (2000ms) > Client heartbeats seems to be broken > ------------------------------------ > > Key: HBASE-17714 > URL: https://issues.apache.org/jira/browse/HBASE-17714 > Project: HBase > Issue Type: Bug > Reporter: Samarth Jain > > We have a test in Phoenix where we introduce an artificial sleep of 2 times the RPC timeout in preScannerNext() hook of a co-processor. > {code} > public static class SleepingRegionObserver extends SimpleRegionObserver { > public SleepingRegionObserver() {} > > @Override > public boolean preScannerNext(final ObserverContext c, > final InternalScanner s, final List results, > final int limit, final boolean hasMore) throws IOException { > try { > if (SLEEP_NOW && c.getEnvironment().getRegion().getRegionInfo().getTable().getNameAsString().equals(TABLE_NAME)) { > Thread.sleep(RPC_TIMEOUT * 2); > } > } catch (InterruptedException e) { > throw new IOException(e); > } > return super.preScannerNext(c, s, results, limit, hasMore); > } > } > {code} > This test was passing fine till 1.1.3 but started failing sometime before 1.1.9 with an OutOfOrderScannerException. See PHOENIX-3702. [~lhofhansl] mentioned that we have client heartbeats enabled and that should prevent us from running into issues like this. FYI, this test fails with 1.2.3 version of HBase too. > CC [~apurtell], [~jamestaylor] -- This message was sent by Atlassian JIRA (v6.3.15#6346)