Return-Path: X-Original-To: apmail-couchdb-dev-archive@www.apache.org Delivered-To: apmail-couchdb-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 772198843 for ; Mon, 8 Aug 2011 22:18:52 +0000 (UTC) Received: (qmail 37653 invoked by uid 500); 8 Aug 2011 22:18:51 -0000 Delivered-To: apmail-couchdb-dev-archive@couchdb.apache.org Received: (qmail 37534 invoked by uid 500); 8 Aug 2011 22:18:51 -0000 Mailing-List: contact dev-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@couchdb.apache.org Delivered-To: mailing list dev@couchdb.apache.org Received: (qmail 37383 invoked by uid 99); 8 Aug 2011 22:18:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Aug 2011 22:18:50 +0000 X-ASF-Spam-Status: No, hits=-2000.8 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Aug 2011 22:18:48 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 26026B2E07 for ; Mon, 8 Aug 2011 22:18:27 +0000 (UTC) Date: Mon, 8 Aug 2011 22:18:27 +0000 (UTC) From: "Henrik Hofmeister (JIRA)" To: dev@couchdb.apache.org Message-ID: <71906335.18072.1312841907152.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <572996361.17940.1312840107142.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (COUCHDB-1243) Compact and copy feature that resets changes MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/COUCHDB-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Henrik Hofmeister updated COUCHDB-1243: --------------------------------------- Description: After running db and view compaction on a 70K doc db with 6+ mio. changes - it takes up 0.8 GB. If copying the same documents to a new db (get and bulk insert) - the same date with 70K changes (only the inserts) takes up 40 mb. That is a huge difference. Has been verified on 2 db's that the difference is more than 65 times the size of data. A "Compact and copy" feature that copies only documents, and resets the changes for at db would be very nice to try and limit the disk usage a little bit. (Our current test environment takes up nearly 100 GB... ) I've attached the dump load php script for your convenience. was: After running db and view compaction on a 7K doc db with 6+ mio. changes - it takes up 0.8 GB. If copying the same documents to a new db (get and bulk insert) - the same date with 7K changes takes up 40 mb. That is a huge difference. Has been verified on 2 db's that the difference is more than 65 times the size of data. A "Compact and copy" feature that copies only documents, and resets the changes for at db would be very nice to try and limit the disk usage a little bit. (Our current test environment takes up nearly 100 GB... ) I've attached the dump load php script for your convenience. > Compact and copy feature that resets changes > -------------------------------------------- > > Key: COUCHDB-1243 > URL: https://issues.apache.org/jira/browse/COUCHDB-1243 > Project: CouchDB > Issue Type: New Feature > Components: Database Core > Affects Versions: 1.0.1, 1.1 > Environment: Ubuntu, but not important > Reporter: Henrik Hofmeister > Labels: cleanup, compaction > Attachments: dump_load.php > > > After running db and view compaction on a 70K doc db with 6+ mio. changes - it takes up 0.8 GB. If copying the same documents to a new db (get and bulk insert) - the same date with 70K changes (only the inserts) takes up 40 mb. That is a huge difference. Has been verified on 2 db's that the difference is more than 65 times the size of data. > A "Compact and copy" feature that copies only documents, and resets the changes for at db would be very nice to try and limit the disk usage a little bit. (Our current test environment takes up nearly 100 GB... ) > I've attached the dump load php script for your convenience. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira