Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EDF9E200B9B for ; Wed, 28 Sep 2016 05:58:05 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id EC70C160AE5; Wed, 28 Sep 2016 03:58:05 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 175AC160AD2 for ; Wed, 28 Sep 2016 05:58:04 +0200 (CEST) Received: (qmail 60119 invoked by uid 500); 28 Sep 2016 03:58:04 -0000 Mailing-List: contact dev-help@tephra.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@tephra.incubator.apache.org Delivered-To: mailing list dev@tephra.incubator.apache.org Received: (qmail 60107 invoked by uid 99); 28 Sep 2016 03:58:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Sep 2016 03:58:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 8C082CA0E2 for ; Wed, 28 Sep 2016 03:58:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.898 X-Spam-Level: X-Spam-Status: No, score=0.898 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_REPLY=1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 8VdOzp3Cw2kr for ; Wed, 28 Sep 2016 03:58:00 +0000 (UTC) Received: from mail-pf0-f171.google.com (mail-pf0-f171.google.com [209.85.192.171]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id DED9A5F56F for ; Wed, 28 Sep 2016 03:57:59 +0000 (UTC) Received: by mail-pf0-f171.google.com with SMTP id 21so12703518pfy.0 for ; Tue, 27 Sep 2016 20:57:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-transfer-encoding; bh=b8bNojQjzW+ja0Wv0sHFSJNgEpG99UtnzdbDnvPzMKA=; b=IPMfA7kcXWK9pY49nu82jgFXRYxvA6E2c7Hg50cv3n7yUoRBfvNyFtwxD2bDwlCBJ0 hBd0G6b8oZQNI26PbglICqQtzO99T5RnBIxEHV8IbjDp3aFpXwVZlT1PYRtKX/TIR52+ LLHsoYQq2XIBLLWLy99eeWzt48LVtoh7pPAY4Hi77d5i348L/hUBC0KgBYy1Ystcft3A TwypfctzPCrIg2amm3eX4PTHrxHwEiaKNxNPwpGS+rctz+5yT7fISMeerseSZt/F7UaE 0hsaUHE8snNf2+3SDpMCieQewC0NFPoUWXabALZs4AZZq+v9+scoJa/JgAGiP31leQqj JcVg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding; bh=b8bNojQjzW+ja0Wv0sHFSJNgEpG99UtnzdbDnvPzMKA=; b=j8YUiUCnsOvOYm8/YlgCzBP/6VV6ZMVCZPLQdSv6WNW8OT3/6+HME9TViPlKGxdYzf A9bACrN9y94nJH0WHaYgINAKDTNpN3+SexsJbd4+qOJ/j/Mm4yxVjm6+H57JA/WGps9O 6a0MTjVMtsHqBgn2mK/oSbFFUGoPVYX88FjtgTZx+MPXPLPt4Z6Hx1c6aMkMkep46xfV P4TtyJ5ChQ3xQK+MgKhqjPLFgoULQ6GvyHMQRvn1NY4Q3rWxCGO472RLX02+aFHdVUS4 L9ca6FXey7AC8/2b0oKetb4mJp00285kYReW+1Lwkxxd6KOnNYNk+VqT6h8GqlIpNBph FLzw== X-Gm-Message-State: AE9vXwPmq25dsmoiu/F9b75kMax6EwPPgkhfLuwkDRylcOstTmDMiWyZph86AsDLl2Zafw== X-Received: by 10.98.62.4 with SMTP id l4mr52886004pfa.13.1475035071691; Tue, 27 Sep 2016 20:57:51 -0700 (PDT) Received: from [192.168.1.102] (115-64-177-97.static.tpgi.com.au. [115.64.177.97]) by smtp.googlemail.com with ESMTPSA id o5sm8107670pfb.9.2016.09.27.20.57.50 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 27 Sep 2016 20:57:51 -0700 (PDT) Subject: Re: Tephra unreliable in CI environments To: dev@tephra.incubator.apache.org References: <564cf774-cb6d-5a71-2bb2-e2488745d0a2@gmail.com> <10542db7-26fc-83cd-8126-9a34644bc484@gmail.com> From: F21 Message-ID: Date: Wed, 28 Sep 2016 13:57:46 +1000 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.3.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit archived-at: Wed, 28 Sep 2016 03:58:06 -0000 Hi Poorna, Thanks, that narrows down the problem. I was spinning up a few VMs with various versions of Ubuntu and their kernels, but that didn't shed any light on the problem. You mentioned that adding the guava-13 jar to the classpath before the guava-12 jar would be a workaround. I am using Phoenix 4.8.1-rc0, so what would be the best way to do this? Cheers, Francis On 28/09/2016 1:43 PM, Poorna Chandra wrote: > Hi Francis, > > This is due to guava-12 vs guava-13 incompatibility [1]. Tephra depends on > guava-13 and HBase depends on guava-12. Depending on how the OS orders the > jars in the classpath, sometimes guava-12 may appear earlier in the > classpath. This leads to the NoSuchMethodError exception. We are planning > on removing Tephra's dependency on guava-13 in the next release. Until then > a workaround is to add guava-13 jar into the classpath before guava-12 jar. > > Thanks, > Poorna. > > [1] - https://issues.apache.org/jira/browse/TEPHRA-181 > > > On Tue, Sep 27, 2016 at 10:09 PM, F21 wrote: > >> Hi Poorna, >> >> That would be very helpful! Unfortunately, I ran into the same issue where >> the image no longer works correct on my dev environment, but works properly >> on travis. >> >> I am not receiving this error: >> >> java.lang.NoSuchMethodError: >> co.cask.tephra.TransactionManager.addListener(Lcom/google/co >> mmon/util/concurrent/Service$Listener;Ljava/util/concurrent/Executor;)V >> at >> co.cask.tephra.distributed.TransactionService$1.leader(Trans >> actionService.java:83) >> at >> org.apache.twill.internal.zookeeper.LeaderElection.becomeLea >> der(LeaderElection.java:229) >> at >> org.apache.twill.internal.zookeeper.LeaderElection.access$ >> 1800(LeaderElection.java:53) >> at >> org.apache.twill.internal.zookeeper.LeaderElection$5.onSucce >> ss(LeaderElection.java:207) >> at >> org.apache.twill.internal.zookeeper.LeaderElection$5.onSucce >> ss(LeaderElection.java:186) >> at >> com.google.common.util.concurrent.Futures$5.run(Futures.java:768) >> at >> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPool >> Executor.java:1142) >> at >> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoo >> lExecutor.java:617) >> at java.lang.Thread.run(Thread.java:745) >> >> This is the exact same problem I encountered and posted on the list about >> previously: https://mail-archives.apache.org/mod_mbox/phoenix-user/20160 >> 3.mbox/%3C56FC727E.30906@gmail.com%3E >> >> It's puzzling that an identical image will behave differently on different >> systems. Does tephra use any kernel functionalities directly? >> >> Cheers, >> Francis >> >> >> On 28/09/2016 12:04 PM, Poorna Chandra wrote: >> >>> Hi Francis, >>> >>> Tephra startup script redirects all logs to a file by default. To help >>> debug such issues, you could update the Tephra startup script [1] to log >>> everything to stdout instead of a file when running inside docker. We are >>> also planning on adding a --foreground option to Tephra startup script, in >>> which case the logs will be written to stdout directly. This will help in >>> debugging in future. >>> >>> Thanks, >>> Poorna. >>> >>> [1] - https://github.com/apache/incubator-tephra/blob/master/bin/ >>> tephra#L175 >>> >>> >>> On Tue, Sep 27, 2016 at 12:59 AM, F21 wrote: >>> >>> I was able to get it to run reliably with Phoenix 4.8.1-rc0. Another part >>>> of the equation was forcing travis to use their Ubuntu 14.04 environment >>>> rather than the default 12.04 environment. I am assuming 12.04 had an >>>> older >>>> kernel which prevent docker images from working correctly. >>>> >>>> On 27/09/2016 10:15 AM, F21 wrote: >>>> >>>> Hi all, >>>>> I have created a docker image containing HBase 1.2.3 and Phoenix 4.8.0. >>>>> See: https://github.com/Boostport/hbase-phoenix-all-in-one >>>>> >>>>> When running tests against the image on my machine, tephra works >>>>> perfectly. >>>>> >>>>> However, tephra seems to be unreliable in CI environments. It seems that >>>>> the tx service is not discovered: >>>>> >>>>> RuntimeException: java.lang.Exception: Thrift error for >>>>> org.apache.tephra.distributed.TransactionServiceClient$2@3e9e291e: >>>>> Unable to discover tx service. -> Exception: Thrift error for >>>>> org.apache.tephra.distributed.TransactionServiceClient$2@3e9e291e: >>>>> Unable to discover tx service. -> TException: Unable to discover tx >>>>> service. >>>>> >>>>> Here's a build on wercker which shows tephra failing: >>>>> https://app.wercker.com/boostport/avatica/runs/build/57e9b5d >>>>> 170a35501008402b4?step=57e9b5f72c15ad000127a534 >>>>> >>>>> I also tried travis, but the same issue occurs: >>>>> https://travis-ci.org/Boostport/avatica/builds/162952367 >>>>> >>>>> Since I am unable to ssh into those docker container on wercker or >>>>> travis, it is hard to debug what's causing tephra to fail. I am hoping >>>>> that >>>>> the issue is related to TEPHRA-179 (and a few other JIRAs related to it) >>>>> which I reported a few weeks ago and has since been fixed. >>>>> >>>>> Has anyone else ran into similar problems? I would love to hear your >>>>> thoughts. >>>>> >>>>> Cheers, >>>>> >>>>> Francis >>>>> >>>>> >>>>>