Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id BFA58200CDE for ; Tue, 25 Jul 2017 01:50:18 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id BDE5A163AAB; Mon, 24 Jul 2017 23:50:18 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id DCCFC163AA9 for ; Tue, 25 Jul 2017 01:50:17 +0200 (CEST) Received: (qmail 85362 invoked by uid 500); 24 Jul 2017 23:50:17 -0000 Mailing-List: contact dev-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@impala.incubator.apache.org Delivered-To: mailing list dev@impala.incubator.apache.org Received: (qmail 85350 invoked by uid 99); 24 Jul 2017 23:50:16 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 Jul 2017 23:50:16 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 57F8EC006E for ; Mon, 24 Jul 2017 23:50:16 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.879 X-Spam-Level: * X-Spam-Status: No, score=1.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=cloudera.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id 6EBQtemVWLf0 for ; Mon, 24 Jul 2017 23:50:14 +0000 (UTC) Received: from mail-wm0-f42.google.com (mail-wm0-f42.google.com [74.125.82.42]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id CD81B5F340 for ; Mon, 24 Jul 2017 23:50:13 +0000 (UTC) Received: by mail-wm0-f42.google.com with SMTP id t201so31859465wmt.1 for ; Mon, 24 Jul 2017 16:50:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cloudera.com; s=google; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=iK0LDiO2YqflfI4ZsCw3tmTgFo4BzTkHoSYAsrL0Lj8=; b=c+hDoWcjskVnW8YdrdMO6bMMAZQ4+2BNOnd/4+iS3NJf/Mx4n6xE/7hj6AD6E+kgjn qVYWAKxnOEHkUxgJbosKnIitJsP5mDew6fLODrYWw4R24AxwNfZEbjHYIdzoSOJVNykU aDP0dw6U9XKmZvA3RFBMt9+EBjBDedU+12CtkWRd0c0m0mTOq81eRVElQdYk+1iS5+pn LkcE0MVkCwH2YFe9OQQ8bkZ/mZwosIWsRKxyjerPsXRWfhGxc3+7LecI7Nvrv7N3jPxG CCl3+Q8xa3hkftScZw2tv1aK63vODtFCSGezR6Y/x/ClA/DMI7LbqPDj0akbxWaodwop 8FvA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=iK0LDiO2YqflfI4ZsCw3tmTgFo4BzTkHoSYAsrL0Lj8=; b=OE9SsxYGjLEqlXE8cAxpgw2KOCuL6DChy4TfU0voqTkON3TqDjyMU88IdDdQWX3CWB bbxYAOeriqn7T3le1DP8+a33WXuSEU/eb/z4Btm92IPRmFYDMsDhYEdXzpGQKuVu2bwp x/HAKeJNQnzsagYH6vLBB+Xh9XE0M73P61+nM0UeUAzlkJVsF7QKpxSnOVfzYVaX2qtK PSvIFrcAFrnpcwyxuCdjz+IA4jYbzenMxibYIiMHRWAMQHVDZQ+xkOgYY+yte3mBol/8 pOO1yUDuXBDN7rlJELoN4Az5vziReSxpPpfHJ+oBjlEb0JxjB0qeA1dCuABj9SFTPBBf icNg== X-Gm-Message-State: AIVw113S+GdYK/xtDkf/iRpMOHlZI2X58QlA8/tre76v62ZiLk8k8bQ/ IC+39sFGQWDOaKB2k/KDeSl6kGA3qnQF X-Received: by 10.28.189.68 with SMTP id n65mr3746689wmf.15.1500940212534; Mon, 24 Jul 2017 16:50:12 -0700 (PDT) MIME-Version: 1.0 Received: by 10.223.139.3 with HTTP; Mon, 24 Jul 2017 16:49:42 -0700 (PDT) In-Reply-To: References: From: Tim Armstrong Date: Mon, 24 Jul 2017 16:49:42 -0700 Message-ID: Subject: Re: IMPALA-5702 - disable shared linking on jenkins? To: "dev@impala" Content-Type: multipart/alternative; boundary="001a114b0e6032abe4055518e09b" archived-at: Mon, 24 Jul 2017 23:50:18 -0000 --001a114b0e6032abe4055518e09b Content-Type: text/plain; charset="UTF-8" I vote for changing Jenkins' linking strategy now and not changing it back :). Static linking is the blessed configuration so I think we should be running tests with that primarily. On Mon, Jul 24, 2017 at 4:34 PM, Henry Robinson wrote: > On 24 July 2017 at 13:58, Todd Lipcon wrote: > > > On Mon, Jul 24, 2017 at 1:47 PM, Henry Robinson > wrote: > > > > > Thanks for the asan pointer - I'll give it a go. > > > > > > My understanding of linking isn't deep, but my working theory has been > > that > > > the complications have been caused by glog getting linked twice - once > > > statically (possibly into libkudu.so), and once dynamically (via > everyone > > > else). > > > > > > > In libkudu_client.so, we use a linker script to ensure that we don't leak > > glog/gflags/etc symbols. Those are all listed as 'local' in > > src/kudu/client/symbols.map. We also have a unit test > > 'client_symbol-test.sh' which uses nm to dump the list of symbols and > make > > sure that they all non-local non-weak symbols are under the 'kudu::' > > namespace. > > > > So it's possible that something's getting linked twice but I'd be > somewhat > > surprised if it's from the Kudu client. > > > > > Good to know, thanks. > > ASAN hasn't turned up anything yet - so does anyone have an opinion about > changing Jenkins' linking strategy for now? > > > > -Todd > > > > > > > > > > I would think that could lead to one or both of the issues you linked > to. > > > > > > > > > On 24 July 2017 at 13:39, Todd Lipcon wrote: > > > > > > > Is it possible that the issue here is due to a "one definition rule" > > > > violation? eg something like > > > > https://github.com/google/sanitizers/wiki/AddressSanitizerOn > > > > eDefinitionRuleViolation > > > > Another similar thing is described here: > > > > https://github.com/google/sanitizers/wiki/AddressSanitizerIn > > > > itializationOrderFiasco > > > > > > > > ASAN with the appropriate flags might help expose if one of the above > > is > > > > related. > > > > > > > > I wonder whether it is a kind of coincidence that it is fine in a > > static > > > > build but causes problems in dynamic, and at some point the static > link > > > > order may slightly shift, causing another new subtle bug. > > > > > > > > > > > > > > > > On Mon, Jul 24, 2017 at 1:22 PM, Henry Robinson > > > wrote: > > > > > > > > > We've started seeing isolated incidences of IMPALA-5702 during > GVOs, > > > > where > > > > > a custom cluster test fails by throwing an exception during locale > > > > > handling. > > > > > > > > > > I've been able to reproduce this locally, but only with shared > > linking > > > > > enabled (which makes sense since the issue is symptomatic of a > global > > > > c'tor > > > > > not getting called the right number of times). > > > > > > > > > > It's probable that my patch for IMPALA-5659 exposed this (since it > > > > forced a > > > > > more correct linking strategy for thirdparty libraries when dynamic > > > > linking > > > > > was enabled), but it looks to me at first glance like there were > > latent > > > > > dynamic linking bugs that we weren't getting hit by. Fixing > > IMPALA-5702 > > > > > will probably take a while, and I don't think we should hold up > GVOs > > or > > > > put > > > > > them at risk. > > > > > > > > > > So there are two options: > > > > > > > > > > 1. Revert IMPALA-5659 > > > > > > > > > > 2. Switch GVO to static linking > > > > > > > > > > IMPALA-5659 is important to commit the kudu util library, which is > > > needed > > > > > for the KRPC work. Without it, shared linking doesn't work *at all* > > > when > > > > > the kudu util library is committed. > > > > > > > > > > Static linking doesn't take much longer in my unscientific > > > measurements, > > > > > and is closer to how Impala is actually used. In the interest of > > > forward > > > > > progress I'd like to try switching ubuntu-14.04-from-scratch to use > > > > static > > > > > linking while I work on IMPALA-5702. > > > > > > > > > > What does everyone else think? > > > > > > > > > > Henry > > > > > > > > > > > > > > > > > > > > > -- > > > > Todd Lipcon > > > > Software Engineer, Cloudera > > > > > > > > > > > -- > > Todd Lipcon > > Software Engineer, Cloudera > > > --001a114b0e6032abe4055518e09b--