couchdb-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Henrik Hofmeister (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (COUCHDB-1243) Compact and copy feature that resets changes
Date Mon, 08 Aug 2011 23:06:27 GMT

    [ https://issues.apache.org/jira/browse/COUCHDB-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13081284#comment-13081284
] 

Henrik Hofmeister commented on COUCHDB-1243:
--------------------------------------------

export/import would do the trick as well - or at least make it easier... However we are using
couchdb intensively for both moderate and huge size dbs... this forever growing changes size
will cause us to switch away from couch eventually - as we are rapidly growing into SAN size
requirements which makes couchdb a very expensive db :( Also making view changes and compaction
is getting to a point where it has to be done in weekends to allow for it to update. Our main
db has 2 changes for every document... with 7 mio documents -  we are facing a staggering
15 mio changes :)

I'd atleast consider that couchdb is - to my understanding - built for web scale - and we
are nowhere near our expected size and already growing out of it? 

> Compact and copy feature that resets changes
> --------------------------------------------
>
>                 Key: COUCHDB-1243
>                 URL: https://issues.apache.org/jira/browse/COUCHDB-1243
>             Project: CouchDB
>          Issue Type: New Feature
>          Components: Database Core
>    Affects Versions: 1.0.1, 1.1
>         Environment: Ubuntu, but not important
>            Reporter: Henrik Hofmeister
>              Labels: cleanup, compaction
>         Attachments: dump_load.php
>
>
> After running db and view compaction on a 70K doc db with 6+ mio. changes - it takes
up 0.8 GB. If copying the same documents to a new db (get and bulk insert) - the same date
with 70K changes (only the inserts) takes up 40 mb. That is a huge difference. Has been verified
on 2 db's that the difference is more than 65 times the size of data.
> A "Compact and copy" feature that copies only documents, and resets the changes for at
db would be very nice to try and limit the disk usage a little bit. (Our current test environment
takes up nearly 100 GB... )
> I've attached the dump load php script for your convenience.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message