Return-Path: X-Original-To: apmail-couchdb-user-archive@www.apache.org Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2DB529181 for ; Fri, 16 Mar 2012 09:53:40 +0000 (UTC) Received: (qmail 77611 invoked by uid 500); 16 Mar 2012 09:53:38 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 77563 invoked by uid 500); 16 Mar 2012 09:53:38 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 77554 invoked by uid 99); 16 Mar 2012 09:53:38 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Mar 2012 09:53:38 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.213.52] (HELO mail-yw0-f52.google.com) (209.85.213.52) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Mar 2012 09:53:33 +0000 Received: by yhpp61 with SMTP id p61so4915792yhp.11 for ; Fri, 16 Mar 2012 02:53:12 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-gm-message-state; bh=CSAFk7/Yo0QiBsWkNoIm1nCBnV5sgWdLXAamgbHWfKs=; b=R/9qkbNgHScuT/7JYPpH7QHEU/2LXYuv6RarOWrRr0c3UwCgHlKp9q2VC4XSXB3DmO FsnD8nadPmKyOOBBZWcsKrOpXjJpHcsg82gMfQfANcYYjzGnkztBwaVp5OP4uTo/Ppob HZGXxJ8Ie4zFa9rwCBbeh2S3Nm7w60JGOmN1GKYNi+ZmnxMeA4F3VBOL+/cjpXsNzvRq +zz4YqaULK83OOtRUcx81uPpeToILAwd6fdmVaTO1tyhg7hpddA45cUgFizytP5YLVRJ p8Yiq+vluKUtor/yAqJTDgflNEyz8xlQ58YkAU0/m/GJjDt3dGV8ri8jKbwV/ugepBL7 SVpQ== Received: by 10.60.18.197 with SMTP id y5mr2126976oed.58.1331891592175; Fri, 16 Mar 2012 02:53:12 -0700 (PDT) MIME-Version: 1.0 Received: by 10.60.19.198 with HTTP; Fri, 16 Mar 2012 02:52:52 -0700 (PDT) In-Reply-To: References: From: Jason Smith Date: Fri, 16 Mar 2012 09:52:52 +0000 Message-ID: Subject: Re: Size of couchdb documents To: user@couchdb.apache.org Content-Type: text/plain; charset=UTF-8 X-Gm-Message-State: ALoCoQlIwFFvdsoBhr2SkeQxd83S4NFq9viBQHZdFjMaF9qlxwm7u7/C2LSj6NXx7F+33XEBzLoF X-Virus-Checked: Checked by ClamAV on apache.org On Fri, Mar 16, 2012 at 9:10 AM, Daniel Gonzalez wrote: >> >> Hi, Daniel. That's great news! Also, I have an update from a CouchDB 1.2.0 >> test. >> >> I have a database here with 10 million documents, most several KB of >> English text. upgrade to version 1.2 changed the database size from >> 38GB to is 9.2GB, or now 0.94 KB per document. >> > > That is interesting. Is CouchDB reducing the size of your stored data? > Compression? Or is the average size of your input data smaller than 0.94KB? > (I am not sure what "most several KB" means) Well, you busted me. I do not know the average size of the documents offhand, but I suspect it is much greater than 400 bytes, because many of the documents are a few KB (maybe 1kb-5kb) of text strings. But, yes, CouchDB 1.2 stores data compressed on the disk. I am using the Snappy option for the minimal CPU hit. http://code.google.com/p/snappy/ -- Iris Couch