Return-Path: Delivered-To: apmail-couchdb-dev-archive@www.apache.org Received: (qmail 4056 invoked from network); 10 Aug 2010 10:04:59 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 10 Aug 2010 10:04:59 -0000 Received: (qmail 69342 invoked by uid 500); 10 Aug 2010 10:04:59 -0000 Delivered-To: apmail-couchdb-dev-archive@couchdb.apache.org Received: (qmail 68978 invoked by uid 500); 10 Aug 2010 10:04:57 -0000 Mailing-List: contact dev-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@couchdb.apache.org Delivered-To: mailing list dev@couchdb.apache.org Received: (qmail 68970 invoked by uid 99); 10 Aug 2010 10:04:56 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Aug 2010 10:04:56 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of randall.leeds@gmail.com designates 209.85.216.52 as permitted sender) Received: from [209.85.216.52] (HELO mail-qw0-f52.google.com) (209.85.216.52) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Aug 2010 10:04:50 +0000 Received: by qwf7 with SMTP id 7so11453955qwf.11 for ; Tue, 10 Aug 2010 03:04:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=Ci9WaWSZYcxgXhdWVncGQkHdWbYbFtDocXyHBicAkKo=; b=k9PR8w/s7eirphy8TrEEBbLtGdPzYVwyOl1yIbVQXW3eDbytgRhn7tuaBGY8EuanrC /rN9kqpqDutPnTa+J/zmI7+YSU8hPRNgA8ArZ9P52qtEyhnhZ7QAHATOvWkhmHsbzv5g zPQmiIK1+7jJyptS/Y5F+Kq7ixHF17itq57Rc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=AU9tF6IML4ie3JIJWQpB2XqQOkac+TWy9yunXFNe7mn46kSkFBfeVPhN+aaRMu9b08 4Iptr3qP+OfRb9UUuO2xCTTp8+ub1JlRgQ9rC+AW/V4ESwy/9zEyQV1M7MAvdRanoozL sIHS6Z0Xc2xegfAEC1wD95jWqTo7rS9+NhbfU= MIME-Version: 1.0 Received: by 10.229.2.7 with SMTP id 7mr8002581qch.277.1281434668575; Tue, 10 Aug 2010 03:04:28 -0700 (PDT) Received: by 10.229.235.131 with HTTP; Tue, 10 Aug 2010 03:04:28 -0700 (PDT) In-Reply-To: References: <8385F758-360B-425A-ACBD-03C898BFDA21@apache.org> <1690416A-4C01-4756-9D3B-A256DC729813@apache.org> <154AD543-C787-441C-851B-D59CEA6765CC@apache.org> <5F47BBB4-9F58-4EFE-92C8-B0FEDA5B01B7@apache.org> Date: Tue, 10 Aug 2010 03:04:28 -0700 Message-ID: Subject: Re: data recovery tool progress From: Randall Leeds To: dev@couchdb.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org I just pushed some more changes to my branch at http://github.com/tilgovi/couchdb/tree/db_repair At this point the algorithm is parallelized to configurable degree (right now defined as ?WORKERS at the top). On testwritesdb it seems to stop around 32M of written docs but the process keeps working for a long time after (I didn't let it finish). I suspect this is due to having found all (or most of all) of our documents by descending from the roots we've already seen. It could also be that Adam's term checking code doesn't quite catch all the cases where terms are split over block boundary markers. Sleep time. Randall