Return-Path: X-Original-To: apmail-couchdb-user-archive@www.apache.org Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0E5251026B for ; Sun, 15 Sep 2013 18:04:59 +0000 (UTC) Received: (qmail 70642 invoked by uid 500); 15 Sep 2013 18:04:55 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 70603 invoked by uid 500); 15 Sep 2013 18:04:55 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 70595 invoked by uid 99); 15 Sep 2013 18:04:54 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 15 Sep 2013 18:04:54 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dch@jsonified.com designates 209.85.215.49 as permitted sender) Received: from [209.85.215.49] (HELO mail-la0-f49.google.com) (209.85.215.49) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 15 Sep 2013 18:04:48 +0000 Received: by mail-la0-f49.google.com with SMTP id ev20so2389949lab.36 for ; Sun, 15 Sep 2013 11:04:27 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=jsonified.com; s=google; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=0YVM3ljIlE5valINlKAWcPwIoV0IisesVdELN7ar1F8=; b=RtQ8cY7r65oa4hCSQ2fFUct/NupavMy1OS0JCRrOkmDZIMjVPx0sUcj2fAKJeL65Vl e0GaUQFgE/bmAw3Frjgo/4sii06EVtHFYN/kRRi6+2+abg5PT/GyBlqQhyDe6k7tE6Qs D+OF36YZpvHOm9itNaynye5SFzFokKQxLFiVM= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=0YVM3ljIlE5valINlKAWcPwIoV0IisesVdELN7ar1F8=; b=Jszo9VH6cuYYEzSZULJjUYQN2B1zGXBxeoQuK8ds05PTwLfj0j6/bAK6oSj0b6sj5w 4j9/hSwum/cl1WQ0ZJNIpJ/bjmb85Zbg6DN7iAlbODA3rlmg296oJIYCy8b2HDXrKCz6 MmmbyJLfFHEdKVHF0SjLBnuibUr81gEfr1cwzpRrLhTB/EtDcgtcl+NDqiEb0tv1rhAt n7rzeE66D6uBRJL7oIQTajEDqZ/TfUHj7BgQCJuYVtBTDcOQQ0ejoROxlJ+nw3hZP6Oj bgdpXg0lIhJPBt3cqib/3QgKnM9pYD2gWsK15KDT63tcIbRP+Kg4rIgQb5clHtqZCTNA kKqA== X-Gm-Message-State: ALoCoQnDtWrchZiTbIQotCUZOwzg7kv3lMzD4J6KtQqP/G2Ux4X8HEQBS+2YT1vhoGpWuh6gcTA4 MIME-Version: 1.0 X-Received: by 10.112.0.242 with SMTP id 18mr21275053lbh.18.1379268267156; Sun, 15 Sep 2013 11:04:27 -0700 (PDT) Received: by 10.112.4.232 with HTTP; Sun, 15 Sep 2013 11:04:27 -0700 (PDT) X-Originating-IP: [84.112.19.176] In-Reply-To: References: <20130913222006.GD2125@translab.its.uci.edu> <20130915031459.GF2125@translab.its.uci.edu> Date: Sun, 15 Sep 2013 20:04:27 +0200 Message-ID: Subject: Re: couchdb crashes silently From: Dave Cottlehuber To: user@couchdb.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org NIF scheduler issues could be a reasonable suspect; heart: Fri Sep 13 20:59:36 2013: heart-beat time-out, no activity for 15 seconds 15 seconds is a *long* time however. 1.4.0 needs 14B04 or higher I think due to one of our dependencies, so I'd suggest reverting back to that & seeing if you are having any other issues. Also, probably unrelated, why is kernel polling disabled? And also likely unrelated, what sort of boxes are these running on, and and are your baseline performance / throughput metrics holding up? On 15 September 2013 15:59, Robert Newson wrote: > But, again, R15 is also new enough to have scheduler problems, if that turns out to be your problem then this change should also fail the same way. I trust R14B01 through extensive punishment, and recommend it. > > B. > > On 15 Sep 2013, at 04:14, James Marca wrote: > >> eacce >