Return-Path: Delivered-To: apmail-couchdb-dev-archive@www.apache.org Received: (qmail 85010 invoked from network); 26 Feb 2010 09:52:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 26 Feb 2010 09:52:51 -0000 Received: (qmail 24375 invoked by uid 500); 26 Feb 2010 09:52:50 -0000 Delivered-To: apmail-couchdb-dev-archive@couchdb.apache.org Received: (qmail 24292 invoked by uid 500); 26 Feb 2010 09:52:50 -0000 Mailing-List: contact dev-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@couchdb.apache.org Delivered-To: mailing list dev@couchdb.apache.org Received: (qmail 24276 invoked by uid 99); 26 Feb 2010 09:52:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Feb 2010 09:52:50 +0000 X-ASF-Spam-Status: No, hits=-1997.3 required=10.0 tests=ALL_TRUSTED,FS_REPLICA,WEIRD_PORT X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Feb 2010 09:52:49 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 6B79A29A0018 for ; Fri, 26 Feb 2010 01:52:29 -0800 (PST) Message-ID: <2004382181.551391267177949438.JavaMail.jira@brutus.apache.org> Date: Fri, 26 Feb 2010 09:52:29 +0000 (UTC) From: "Randall Leeds (JIRA)" To: dev@couchdb.apache.org Subject: [jira] Updated: (COUCHDB-597) Replication tasks crash. In-Reply-To: <1826646313.1260661158168.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/COUCHDB-597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Randall Leeds updated COUCHDB-597: ---------------------------------- Attachment: 0001-changes-replication-timeouts-and-att.-fixes-COUCHDB-.patch I went back and made this patch SUPER simple and straightforward. Applies to the very most current trunk. This should take no more than a minute to review; it's super simple now. > Replication tasks crash. > ------------------------ > > Key: COUCHDB-597 > URL: https://issues.apache.org/jira/browse/COUCHDB-597 > Project: CouchDB > Issue Type: Bug > Components: Database Core > Affects Versions: 0.11 > Reporter: Robert Newson > Attachments: couchdb_597.patch > > > If I kick off 10 replication tasks in quick succession, occasionally one or two of the replication tasks will die and not be resumed. It seems that the stat tracking is a little buggy, and under stress can eventually cause a permanent failure of the supervised replication task; > [Fri, 11 Dec 2009 19:00:08 GMT] [error] [<0.80.0>] {error_report,<0.30.0>, > {<0.80.0>,supervisor_report, > [{supervisor,{local,couch_rep_sup}}, > {errorContext,shutdown_error}, > {reason,killed}, > {offender, > [{pid,<0.6700.11>}, > {name,"fcbb13200a1618cf983b347f4d2c9835+create_target"}, > {mfa, > {gen_server,start_link, > [couch_rep, > ["fcbb13200a1618cf983b347f4d2c9835", > {[{<<"create_target">>,true}, > {<<"source">>,<<"http://node:5984/perf-p2">>}, > {<<"target">>,<<"perf-p2">>}]}, > {user_ctx,null,[<<"_admin">>]}], > []]}}, > {restart_type,temporary}, > {shutdown,1}, > {child_type,worker}]}]}} > [Fri, 11 Dec 2009 19:00:08 GMT] [error] [emulator] Error in process <0.6705.11> with exit value: {badarg,[{ets,insert,[stats_hit_table,{{couchdb,open_os_files},-1}]},{couch_stats_collector,decrement,1}]} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.