From dev-return-27997-archive-asf-public=cust-asf.ponee.io@geode.apache.org Thu Feb 15 22:44:36 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 2F38718064A for ; Thu, 15 Feb 2018 22:44:36 +0100 (CET) Received: (qmail 98696 invoked by uid 500); 15 Feb 2018 21:44:30 -0000 Mailing-List: contact dev-help@geode.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@geode.apache.org Delivered-To: mailing list dev@geode.apache.org Received: (qmail 98669 invoked by uid 99); 15 Feb 2018 21:44:29 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Feb 2018 21:44:29 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 0E1DCC143E for ; Thu, 15 Feb 2018 21:44:29 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.001 X-Spam-Level: X-Spam-Status: No, score=-0.001 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=pivotal-io.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id jWlOqls65ZVw for ; Thu, 15 Feb 2018 21:44:27 +0000 (UTC) Received: from mail-pl0-f44.google.com (mail-pl0-f44.google.com [209.85.160.44]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 9C6875F257 for ; Thu, 15 Feb 2018 21:44:26 +0000 (UTC) Received: by mail-pl0-f44.google.com with SMTP id h10so579264plt.5 for ; Thu, 15 Feb 2018 13:44:26 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=pivotal-io.20150623.gappssmtp.com; s=20150623; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-transfer-encoding:content-language; bh=OKAwEsrFVGRFk/uQjgRUvop5hYdEkni0TSuDYQkLX/c=; b=npCi+rhUGBTaz47BoDGOopJJUbjnfbd1pASxr/F8Qg+wxyytudFaOnBAEn1D23x62w Kx5jBt0ffm4Q4wDl6l+Brh90P08PxtuOMJelGCvWSTDz+qUkXeQWgm2efAGNQaYYLgvR 9FjN9tSX2I7V+vdIzkrrifQQ6YUbbE6dAcXZ0sZJHN00S6vmO7HHNJmnLrUP6DxSjML2 40RmqLi96esy4F3pxanbb644NWx2aU/L5fz6BY5kEYakQOe6V1mDRy18t2ixlkWyT+v2 Udd9gMrWRPExn4eKnADFqy41pjCdw+FnelzLYDyHoms+i7Da6uCfgizN8K2LSCyjTBM6 TQZA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=OKAwEsrFVGRFk/uQjgRUvop5hYdEkni0TSuDYQkLX/c=; b=dIR0hz31/vr67aWwUIfx2fIoVq8ZK1C1o8UDbbKpIxn3D69x64HyiLNydsG7lZivcu ebUla4k6Ghul0W7xPMkcoq7mdl57U+qzr3lc7KMoCyUjcyLyWkwE4q9JPva7hf3msbSi L0aGYIMAKdceWUvx4q3vvgWTcSphSp7Dgkeb9yyT3fYmMH0mzP7YaW/JYl2btVgWZQAZ 2twLcBts8g4qWM4lN/j3CcneZW/I9hS41k0sCg1iZBY0aCskx+57AOgPeen1OTj4Polf P9CiKDCZ/YMsEF0hHF/KIerDhFXOW+5GWHNSz5Wu6NTolXOL0BAMUjQUUY66AMG419f2 Qavw== X-Gm-Message-State: APf1xPBnCRJ8K+hqf5xlIjUo0PY2vdLnncY4YfT77vCh1DfVHYIbjIak r4x2VZ/bfPaALklUbqi29DrKEKGsNQ== X-Google-Smtp-Source: AH8x225sy2bWsO0Fnc+GSgqHnBO9oDg2U68X1aqK2REkYwsR4ljrynYGwiHRUIEWLD2XLwZu9sTizA== X-Received: by 2002:a17:902:7201:: with SMTP id ba1-v6mr3741028plb.248.1518731059891; Thu, 15 Feb 2018 13:44:19 -0800 (PST) Received: from [192.168.1.218] (static-50-53-183-59.bvtn.or.frontiernet.net. [50.53.183.59]) by smtp.gmail.com with ESMTPSA id f79sm45682015pfd.103.2018.02.15.13.44.19 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 15 Feb 2018 13:44:19 -0800 (PST) Subject: Re: Intermittent join-response hang To: dev@geode.apache.org References: From: Bruce Schuchardt Message-ID: <167fc4e2-07da-1d3f-c08e-b91c2f7bd614@pivotal.io> Date: Thu, 15 Feb 2018 13:44:18 -0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US If it happens again send comms a full log and maybe we can diagnose the problem.  I haven't had this happen on my mac. On 2/15/18 1:35 PM, Kirk Lund wrote: > I'm intermittently seeing dunit tests hang (actually fails after timing > out) at this point in joining a cluster: > > [locator] [info 2018/02/15 12:56:25.583 PST > tid=71] This member is becoming coordinator > > [vm0] [info 2018/02/15 12:56:25.584 PST Connection(1)-10.118.20.39> tid=20] Probable coordinator is still > 10.118.33.216(17575:locator):32769 - waiting for a join-response > > This is when running any dunit test that has a cluster in IntelliJ on my > Mac. > > It's like the code just intermittently gets stuck in the join-response for > coordinator. > > Any ideas if this is specific to Mac or something? I don't see this happing > in precheckin on Linux. >