From user-return-15858-apmail-couchdb-user-archive=couchdb.apache.org@couchdb.apache.org Sat Apr 16 13:18:34 2011 Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 68771 invoked from network); 16 Apr 2011 13:18:34 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Apr 2011 13:18:34 -0000 Received: (qmail 32828 invoked by uid 500); 16 Apr 2011 13:18:31 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 32767 invoked by uid 500); 16 Apr 2011 13:18:31 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 32751 invoked by uid 99); 16 Apr 2011 13:18:31 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Apr 2011 13:18:31 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [80.244.253.218] (HELO mail.traeumt.net) (80.244.253.218) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Apr 2011 13:18:24 +0000 Received: from [192.168.178.25] (brln-4d0cc975.pool.mediaWays.net [77.12.201.117]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by mail.traeumt.net (Postfix) with ESMTPSA id 7D7EB3C13D; Sat, 16 Apr 2011 15:18:02 +0200 (CEST) Subject: Re: Couchdb trunk purge_docs timeout Mime-Version: 1.0 (Apple Message framework v1084) Content-Type: text/plain; charset=us-ascii From: Jan Lehnardt In-Reply-To: <1302794092.5706.23.camel@mike.loop.com.br> Date: Sat, 16 Apr 2011 15:18:01 +0200 Cc: dev@couchdb.apache.org Content-Transfer-Encoding: quoted-printable Message-Id: References: <1302794092.5706.23.camel@mike.loop.com.br> To: user@couchdb.apache.org X-Mailer: Apple Mail (2.1084) Hi Mike, we did a fix in this area recently that affected purging of docs in = conflict: http://svn.apache.org/viewvc?rev=3D1086241&view=3Drev A couple of reviewers deemed the patch safe, but this is a seldom = exercised part of the code, so we may have introduced your issue. Can you provide us with a reproducing script that maybe doesn't depend = on 56m docs? :) Also, can you paste the full error stack trace? A few more questions: - Is replication involved here? - Do you have more I/O than before on the system? CCing dev@. Cheers Jan --=20 On 14 Apr 2011, at 17:14, Mike Leddy wrote: > Hi, >=20 > I have a couch node current using version 1.2.0abaa0e30-git. I decided > to try a database maintenance task that I formerly used to use on > couchdb 1.0.2 to purge documents in batches of 500 on a database that > contains some 56 million documents. >=20 > =46rom what I can gather from the logs the call to purge_docs is = timing > out after 5 seconds and terminating. >=20 > [Wed, 13 Apr 2011 20:25:57 GMT] [info] [<0.5192.19>] 172.17.17.3 - - = GET /iris/_design/tidy/_view/conflicts?limit=3D0 200 > [Wed, 13 Apr 2011 20:26:02 GMT] [error] [<0.5194.19>] Uncaught error = in HTTP request: {exit, > {timeout, > = {gen_server,call, > [<0.150.0>, > {purge_docs, > = [{<<"1294099271F6261">>, > [{1, > = <<181,64,95, > = 54,247,104, > = 56,34,109, > = 228,7,108, > = 250,72,57, > = 190>>}]}, > = {<<"1294099281F7327">>, > [{1, > = <<80,246,15, > = 155,182,61, > = 43,238,207, > = 43,159,136, > = 178,134, > = 137,214>>}]}, > ... removed for brevity .... > [Wed, 13 Apr 2011 20:26:02 GMT] [info] [<0.5194.19>] Stacktrace: = [{io_lib_pretty,cind_tag_tuple,7}, > {io_lib_pretty,while_fail,3}, > {io_lib_pretty,print,6}, > {io_lib_format,build,3}, > {io_lib_format,build,3}, > {io_lib_format,build,3}, > {io_lib_format,build,3}, > {io_lib_format,build,3}] > [Wed, 13 Apr 2011 20:26:02 GMT] [info] [<0.5194.19>] 172.17.17.3 - - = POST /iris/_purge 500 >=20 > I am pretty sure that this was not the case with 1.0.2. Does anyone=20 > have any insight regarding what is the root cause of the problem ? >=20 > Meanwhile I'm digging through the code looking for clues.... >=20 > Thanks, >=20 > Mike >=20