Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 68359 invoked from network); 29 Apr 2009 22:12:12 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 29 Apr 2009 22:12:12 -0000 Received: (qmail 80072 invoked by uid 500); 29 Apr 2009 22:12:11 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 79995 invoked by uid 500); 29 Apr 2009 22:12:11 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 79985 invoked by uid 99); 29 Apr 2009 22:12:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Apr 2009 22:12:11 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [128.200.36.30] (HELO translab.its.uci.edu) (128.200.36.30) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Apr 2009 22:12:04 +0000 Received: from translab.its.uci.edu (localhost.localdomain [127.0.0.1]) by translab.its.uci.edu (8.13.1/8.12.10) with ESMTP id n3TMBcsR030570 for ; Wed, 29 Apr 2009 15:11:38 -0700 Received: (from jmarca@localhost) by translab.its.uci.edu (8.13.1/8.13.1/Submit) id n3TMBc8V030569 for user@couchdb.apache.org; Wed, 29 Apr 2009 15:11:38 -0700 Date: Wed, 29 Apr 2009 15:11:38 -0700 From: James Marca To: user@couchdb.apache.org Subject: how much stuff can one stuff into a CouchDB? Message-ID: <20090429221138.GA22995@translab.its.uci.edu> Mail-Followup-To: user@couchdb.apache.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4.1i X-ITS-MailScanner: Found to be clean X-ITS-MailScanner-From: jmarca@translab.its.uci.edu X-ITS-Spam-Status: No X-Virus-Checked: Checked by ClamAV on apache.org Hi All, On the Wiki, the FAQ says: Q: How Much Stuff can I Store in CouchDB? A: With node partitioning, virtually unlimited. For a single database instance, the practical scaling limits aren't yet known. Is there some more recent guidance on this? I read the wiki pages "Configuring distributed systems", "Partitioning proposal", and "HTTP Bulk Document API", but as far as I tell, node partitioning isn't implemented yet (right? things are moving really fast around here!). I have about 70G of gzipped files (about 3,000 files) that I need to unzip. convert to json, and store. Unzipping explodes each file by about a factor of 7. I expect that adding the JSON structure will increase the data size even more. I read in an earlier posting that compacting the database will compress it back down significantly, but still, that's a big database file. I also have the option to break up the data into 9 logical chunks, but if I don't have to do it, I'd rather not. Anybody have any advice or experience with really big databases? Regards, James -- James E. Marca Researcher Institute of Transportation Studies AIRB Suite 4000 University of California Irvine, CA 92697-3600 jmarca@translab.its.uci.edu -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean.