Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 41434 invoked from network); 17 Jun 2010 22:00:30 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 17 Jun 2010 22:00:30 -0000 Received: (qmail 41841 invoked by uid 500); 17 Jun 2010 22:00:29 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 41774 invoked by uid 500); 17 Jun 2010 22:00:28 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 41765 invoked by uid 99); 17 Jun 2010 22:00:28 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Jun 2010 22:00:28 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of norman.barker@gmail.com designates 74.125.83.52 as permitted sender) Received: from [74.125.83.52] (HELO mail-gw0-f52.google.com) (74.125.83.52) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Jun 2010 22:00:23 +0000 Received: by gwj20 with SMTP id 20so297093gwj.11 for ; Thu, 17 Jun 2010 15:00:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:date:message-id :subject:from:to:content-type; bh=byFthkmSR+WvOcIznNaPJUn80ezmkC3enKpBNXrTspY=; b=QrOVj3Y2VFnjDGUl9TubirdJC/KSLuzi+Ul5RKh5Xq41SrodsF+P1HTevkw882q5ge jrImhn6nljZiYot9qPhHDFTVuMJREk7hdq4dNmcheWuv+m2L7P4pdGOfHfC34/ZEfD2g um2J4lJjxMc7Uy/Lya38FbP6G61jxtrbahqik= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=WDearh4RHyf8JARcretzDlxVJbvviy00ZMCuOFFzA1pTZA1O/N2qzTpsfnFBwX1bl3 iwbk9HVdoh7cXOsLREKJ9Ies/ossryhyi1gMTbulAch3gea1jCjQZMgEOXgJhijbSsDX IQ2ajlCzwE1inQjt6yMkFWCd+fp0Ig1n8xtdM= MIME-Version: 1.0 Received: by 10.150.244.1 with SMTP id r1mr110459ybh.374.1276812002036; Thu, 17 Jun 2010 15:00:02 -0700 (PDT) Received: by 10.150.196.1 with HTTP; Thu, 17 Jun 2010 15:00:01 -0700 (PDT) Date: Thu, 17 Jun 2010 16:00:01 -0600 Message-ID: Subject: using gzip for db and view indexes From: Norman Barker To: user@couchdb.apache.org Content-Type: text/plain; charset=ISO-8859-1 Hi, I am looking at the couchdb db database and view index directory and I see the files are saved as binary, my indexes and database are getting fairly large so I tried gzipping them (by hand) and it made a big difference (at least for my data). Looking at http://www.erlang.org/doc/man/file.html I see that compressed is an option when reading or writing a file, is it worth trying this out, could it be an option in the ini file so we could trade off database size versus a possible lag in access? I can do look into this, does everything go through the couch_file module and is there a suitable test dataset that we can analyse performance with? thanks, Norman