Return-Path: X-Original-To: apmail-couchdb-dev-archive@www.apache.org Delivered-To: apmail-couchdb-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 01052F7EA for ; Fri, 19 Apr 2013 05:04:28 +0000 (UTC) Received: (qmail 59448 invoked by uid 500); 19 Apr 2013 05:04:26 -0000 Delivered-To: apmail-couchdb-dev-archive@couchdb.apache.org Received: (qmail 59306 invoked by uid 500); 19 Apr 2013 05:04:24 -0000 Mailing-List: contact dev-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@couchdb.apache.org Delivered-To: mailing list dev@couchdb.apache.org Received: (qmail 59241 invoked by uid 99); 19 Apr 2013 05:04:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Apr 2013 05:04:21 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [98.138.91.176] (HELO nm16-vm4.bullet.mail.ne1.yahoo.com) (98.138.91.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Apr 2013 05:04:15 +0000 Received: from [98.138.90.48] by nm16.bullet.mail.ne1.yahoo.com with NNFMP; 19 Apr 2013 05:03:54 -0000 Received: from [98.138.87.12] by tm1.bullet.mail.ne1.yahoo.com with NNFMP; 19 Apr 2013 05:03:54 -0000 Received: from [127.0.0.1] by omp1012.mail.ne1.yahoo.com with NNFMP; 19 Apr 2013 05:03:54 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 779920.13260.bm@omp1012.mail.ne1.yahoo.com Received: (qmail 72658 invoked by uid 60001); 19 Apr 2013 05:03:54 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1366347834; bh=JX0oQkCakaOlP88pYcT56DXusvyTe7Gi0HeoJT9/pJc=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:Message-ID:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=ei8cVCa45srYWu7npNEa5mkMVnzSxKTVeMRPDW5qa25vx+lvQGOcVvr1LjpKQUgY1bMUiGV+6NP58EHDxNzRcKfI7OubYXbrUaA0S5FdxH1RYH6MriyoHMRh8OTwqdq2NOtneZmJJtQv4m/NlCOMXqmXAmGhnQbKpOIOAP8MwTE= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:Message-ID:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=JmrWuKOc/cOtlL8F3qiM0J1OeRwTRD+b9uzrqenMiCE1REawZDil4AAf93Zoj6e+IFt9qrO6CV5kVHtrYVAPQeb/8OSfTCvxmKdTWcvMRG/6hwbv8zxyJDISip8yhDA2E52l4Y2I0UT78aTXY5I8itzoclFd1r84X7xpTkadMSw=; X-YMail-OSG: 8xDvrsMVM1lXRanULmTTOJyR190mm_EurZbPY8bvgWLU_jt .pPhIDJBYpGjvuzWq9HtLn8CYb_7OaDMaIcOGSF02z4CPICQbJ9Dz.NNvZSd HCZ7W6O1J6Bj0tthKJTUshgOsjyOLZrV8q2g_VyW9jjzNpBssDXg4TfHVpKo 1tIGtXhXlZhDbwYn_OkdtGw5GyLsXaSiZQq028wgR7houhAAoRQPcZSuhfrx 7cIvAwHIe_pHD5wvXAfi3Z85XK67MiDhVJIzhJnOp28WbgvFhMcOWIZBwGzD 4xztTb8E8n2lpTjjqKtLDxaQV._dh2NXjccyOwie7Dn39wTx9hIIjiKDQrDV bCsLLEjFrTqXh6_R6O70MCzWurQXbSZ0Rd3DuY42oRWQaA0SZOXRBtRzGj3D ecvo.1jp0zF4SXxO4LUrTRoxLCpjeR3d44bhfJXg_Gxn1DPyf0apfaLL3w3D N5bES.IDPQNlT.74nyfP6FZWUm93TBbIiqzTEl2onOwaT9OYJOymn969Ih7w Jnx_TWo1OphlOnXxkSsirwtDvmgkQDtrAuWff2jGwyg_36SeNT5mz0B1C7g8 KqxTENAruvhYv7qZnSEVtQ_Upx8SiJZcLIm7xU9fI_7tsVfrvdmmh_gSWsHm _QdDlBGbkGCq_RoJMN_ofi038vt8grJns9l.wrtsPBMHrzNu9uB3ZX0gW0YZ hTJtt Received: from [83.251.146.35] by web121006.mail.ne1.yahoo.com via HTTP; Thu, 18 Apr 2013 22:03:54 PDT X-Rocket-MIMEInfo: 002.001,DQpUaGFuayB5b3UgdmVyeSBtdWNoIFZpY3Rvcixob3cgY2FuIGkgb3BlbiB0aGlzIGZvbGRlciwgd2hpY2ggcHJvZ3JhbSBpIG11c3QgdXNlZD8NCi0tLSBPbiBGcmksIDQvMTkvMTMsIFZpY3RvciBOaWNvbGxldCA8dm5pY29sbGV0QHJ1bm9yZy5jb20.IHdyb3RlOg0KDQpGcm9tOiBWaWN0b3IgTmljb2xsZXQgPHZuaWNvbGxldEBydW5vcmcuY29tPg0KU3ViamVjdDogUmU6IENvcnJ1cHRlZCBkYXRhYmFzZSBleGFtcGxlIGZpbGUNClRvOiBkZXZAY291Y2hkYi5hcGFjaGUub3JnDQpEYXRlOiBGcmlkYXksIEEBMAEBAQE- X-Mailer: YahooMailClassic/15.1.7 YahooMailWebService/0.8.141.536 Message-ID: <1366347834.72418.YahooMailClassic@web121006.mail.ne1.yahoo.com> Date: Thu, 18 Apr 2013 22:03:54 -0700 (PDT) From: Ashraf Janan Subject: Re: Corrupted database example file To: dev@couchdb.apache.org In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="285087016-811432512-1366347834=:72418" X-Virus-Checked: Checked by ClamAV on apache.org --285087016-811432512-1366347834=:72418 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Thank you very much Victor,how can i open this folder, which program i must= used? --- On Fri, 4/19/13, Victor Nicollet wrote: From: Victor Nicollet Subject: Re: Corrupted database example file To: dev@couchdb.apache.org Date: Friday, April 19, 2013, 12:17 AM It had happened once on a critical production database (the user database...) so I wrote some code to repair it. And I never throw away any code. If you're interested (but I doubt it : it's pretty useless), I could share the repair code. More info on the logs : apparently, the first compact-related crash happened Wed, 17 Apr 2013 11:54:08 GMT : since I have hourly compacts, it means the corruption happened Wed, 17 Apr 2013 10:54:08 GMT at the earliest. Sifting through that period right now... On 19 April 2013 00:13, Robert Newson wrote: > You say this happens often? Clearly often enough that you have a > routine to repair it. > > B. > > On 18 April 2013 23:12, Robert Newson wrote: > > Hi Victor, > > > > Thanks for the information, we appreciate it. > > > > B. > > > > On 18 April 2013 23:07, Victor Nicollet wrote: > >> Replying to my own mail, hoping it will end up in the same thread (I w= as > >> not fully subscribed when I posted this, but I still read the archives= ). > >> > >> Answers to the questions you asked : > >> > >>=A0 - I have no idea when the issue happened. I will try to track it do= wn > in > >> the logs. I'm afraid I don't have time to filter out all customer > >> information from the logs and provide them to you, though I can > certainly > >> grep for error dumps if you want me to. I have never seen disk-related > >> errors in the log. > >>=A0 - I am running Debian x86_64 GNU/Linux, with erlang 1:15.b.1-d > >>=A0 - There are no unusual CouchDB configuration options ; the only cha= nge > I > >> performed was to disable reduce_limit. A perhaps notable usage aspect = : > all > >> the databases are compacted hourly. > >>=A0 - It's not NFS. From /etc/fstab : > >> > >> /dev/sda1=A0 =A0 =A0=A0=A0/=A0 =A0 =A0=A0=A0ext4=A0 =A0 errors=3Dremou= nt-ro=A0 =A0 =A0=A0=A00=A0 =A0 =A0=A0=A01 > >> /dev/sda2=A0 =A0 =A0=A0=A0/home=A0=A0=A0ext4=A0 =A0 defaults=A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 0=A0 =A0 =A0=A0=A02 > >> > >> The dual-partition setup is a silly default from OVH (my dedicated > server > >> host), so I have /var/lib/couchdb as a symlink to /home/couchdb/lib, > from > >> sda1 to sda2. > >> > >> - I can't rule out a disk issue, because I don't have a lot of > experience > >> with those... any obvious diagnosis command you would like me to run ? > I am > >> certain that I have not run out of disk space, though (still around 1T= B > >> free on that drive). > >> > >> Thank you for your patience. > >> > >> On 18 April 2013 14:17, Victor Nicollet wrote: > >> > >>> Hello, > >>> > >>> The @CouchDB twitter account thought you might find this information > >>> helpful. > >>> > >>> My SaaS start-up uses CouchDB as its primary database. Lately, I have > been > >>> having database corruption issues with version 1.2.0 : every few > weeks, one > >>> of our databases becomes corrupted, which has several negative > consequences > >>> (among others) : > >>> > >>>=A0 =A0 - Replication of that database fails (it does not even start). > >>>=A0 =A0 - Compaction of that database fails and *freezes* the server. > >>>=A0 =A0 - Several documents in the database become inaccessible throug= h > either > >>>=A0 =A0 direct access or through _all_docs. > >>> > >>>=A0 The latest affected database does not contain any information abou= t > our > >>> customers, so I am allowed to release it publicly : > >>> > >>> http://nicollet.net/public/2013-04-18.couchdb/prod-folder.couch > >>> > >>> This database contains 325 irretrievable documents between identifier= s > >>> 2xFEY0pU2Eb and 3Fn6l04G6Oa. > >>> I hope this helps, > >>> > >>> -- > >>> Victor Nicollet, CTO, www.runorg.com > >>> > >> > >> > >> > >> -- > >> Victor Nicollet, Directeur Technique, www.runorg.com > --=20 Victor Nicollet, Directeur Technique, www.runorg.com --285087016-811432512-1366347834=:72418--