Return-Path: X-Original-To: apmail-drill-dev-archive@www.apache.org Delivered-To: apmail-drill-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AAF8919644 for ; Fri, 25 Mar 2016 19:02:18 +0000 (UTC) Received: (qmail 33766 invoked by uid 500); 25 Mar 2016 19:02:18 -0000 Delivered-To: apmail-drill-dev-archive@drill.apache.org Received: (qmail 33710 invoked by uid 500); 25 Mar 2016 19:02:18 -0000 Mailing-List: contact dev-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list dev@drill.apache.org Received: (qmail 33698 invoked by uid 99); 25 Mar 2016 19:02:18 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Mar 2016 19:02:18 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id BE8F6C330E for ; Fri, 25 Mar 2016 19:02:17 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.198 X-Spam-Level: * X-Spam-Status: No, score=1.198 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=maprtech.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id EQRsYLY5GNmg for ; Fri, 25 Mar 2016 19:02:15 +0000 (UTC) Received: from mail-pf0-f171.google.com (mail-pf0-f171.google.com [209.85.192.171]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 6628C5F238 for ; Fri, 25 Mar 2016 19:02:15 +0000 (UTC) Received: by mail-pf0-f171.google.com with SMTP id n5so87884126pfn.2 for ; Fri, 25 Mar 2016 12:02:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=maprtech.com; s=google; h=from:message-id:mime-version:subject:date:references:to:in-reply-to; bh=qHSlB7FUT3DdVw+sQegZ+6HJc9/NXbS3Sz/d4nv7aHs=; b=Nuv/5CAq68+NgZ0aZVISIrZI+0Gogk45R5/DBZKXRSBzrr+cGPFqgx8EWlZ794xcyK zrbBiCXipqWon2BWrVceeaGcHdoWzPfBjyG4luT0bCXcDQQlP4gy9Fco6jGUn4tB9ee6 A1Zg/IhVebTS4Lg4hqoEZUrKyh8MkWuP3QQbE= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:message-id:mime-version:subject:date :references:to:in-reply-to; bh=qHSlB7FUT3DdVw+sQegZ+6HJc9/NXbS3Sz/d4nv7aHs=; b=fClB94UfXRniZXeTIDv/WbfnY3ofSr7dd73uP+79Fcrqwr3CA+PWwiX14d9HYXkq0j uLVXgHDK0tIUpVulUlr+JFXHX12moaF4N634cHgDoOtAkCNSgtdc54G+3cz4tuEHhOnL EwkMLL6ssbogMyX0lpfUxf59s+YBy5P0SbMXL2sl89ySKgb5zWo+gWxpAeVbr9gRoRML TZdBorbdR5wDUSJIlqcfhD+X1SUzFiizE8KU+UF87NvszlwoP17qt3CHD81s2voMPP8T UpiS5cKlGRtyEGV7peRhDV+gLZoWimHS961oBGEyTLmnhapl9e6nVkcClQoJhpY5NPcz G6sA== X-Gm-Message-State: AD7BkJIDh+1BVcyimLYraPufQD11B2SHoMrWY4Ra0yrd99nsvSFpEiZ06EDW5Tfy1fCezzIr X-Received: by 10.98.75.147 with SMTP id d19mr22887805pfj.29.1458932534135; Fri, 25 Mar 2016 12:02:14 -0700 (PDT) Received: from [10.250.50.45] ([12.220.154.66]) by smtp.gmail.com with ESMTPSA id xs10sm18085297pab.4.2016.03.25.12.02.12 for (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Fri, 25 Mar 2016 12:02:13 -0700 (PDT) From: Sudheesh Katkam Content-Type: multipart/alternative; boundary="Apple-Mail=_65E04765-9F26-4C0E-927A-C70F7443616A" Message-Id: Mime-Version: 1.0 (Mac OS X Mail 9.2 \(3112\)) Subject: Re: epoll disconnections? Date: Fri, 25 Mar 2016 12:02:12 -0700 References: To: dev@drill.apache.org In-Reply-To: X-Mailer: Apple Mail (2.3112) --Apple-Mail=_65E04765-9F26-4C0E-927A-C70F7443616A Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 @Jacques, let me check the performance test [1] results from the past = with epoll disabled (or we have to re-run). @Hanifi, With higher concurrency in performance testing, queries failed = with this error: java.io.IOException: syscall:read(...)() failed: Connection reset by = peer This is a known issue [2] with the version of Netty that Drill uses. I = don=E2=80=99t think there is a specific ticket, but a search [3] shows a = few relevant tickets. Thank you, Sudheesh [1] https://github.com/mapr/drill-perf-test-framework = [2] https://github.com/netty/netty/issues/3539 = [3] = https://issues.apache.org/jira/browse/DRILL-3119?jql=3Dproject%20%3D%20DRI= LL%20AND%20text%20~%20%22syscall%3Aread%22 = > On Mar 25, 2016, at 11:44 AM, Hanifi Gunes = wrote: >=20 > I am wondering what the issue and its manifestation was back then. Do = we > have any JIRAs created for this before? >=20 >=20 > Thanks. > -Hanifi >=20 > On Fri, Mar 25, 2016 at 9:57 AM, Jacques Nadeau = wrote: >=20 >> Hey All, >>=20 >> If I recall correctly, many months ago Sudheesh discovered that we = were >> having instability in RPC connections in some situations due to bugs = in the >> epoll implementation that are fixed in a later version of Netty = (~4.0.31?). >> At the time, we shelved switching Netty because it also changed the = memory >> caching behavior (same thread to all thread) which seemed like a high = risk >> change. I thought that as part of this we decided the safest change = was to >> disable epoll RPC in our distribution. However, reviewing drill-env, = it >> doesn't look like we do this. See here [1]. >>=20 >> Thoughts? >>=20 >> [1] >>=20 >> = https://github.com/apache/drill/blob/master/distribution/src/resources/dri= ll-env.sh#L19 >> -- >> Jacques Nadeau >> CTO and Co-Founder, Dremio >>=20 --Apple-Mail=_65E04765-9F26-4C0E-927A-C70F7443616A--