Return-Path: X-Original-To: apmail-couchdb-user-archive@www.apache.org Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AF1CA9099 for ; Mon, 14 May 2012 23:59:15 +0000 (UTC) Received: (qmail 77608 invoked by uid 500); 14 May 2012 23:59:14 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 77561 invoked by uid 500); 14 May 2012 23:59:14 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 77552 invoked by uid 99); 14 May 2012 23:59:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 May 2012 23:59:14 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=SPF_PASS,T_TVD_MIME_EPI X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [128.200.36.30] (HELO translab.its.uci.edu) (128.200.36.30) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 May 2012 23:59:07 +0000 Received: from translab.its.uci.edu (localhost.localdomain [127.0.0.1]) by translab.its.uci.edu (8.13.1/8.12.10) with ESMTP id q4ENwev7019420 for ; Mon, 14 May 2012 16:58:40 -0700 Received: (from jmarca@localhost) by translab.its.uci.edu (8.13.1/8.13.1/Submit) id q4ENwedC019419 for user@couchdb.apache.org; Mon, 14 May 2012 16:58:40 -0700 Date: Mon, 14 May 2012 16:58:40 -0700 From: James Marca To: "user@couchdb.apache.org" Subject: Re: reducing db size Message-ID: <20120514235840.GE2665@translab.its.uci.edu> Mail-Followup-To: "user@couchdb.apache.org" References: <20120514200824.GB2665@translab.its.uci.edu> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="ffoCPvUAPMgSXi6H" Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.1i X-ITS-MailScanner-Information: Please contact the ISP for more information X-ITS-MailScanner: Found to be clean X-ITS-MailScanner-From: jmarca@translab.its.uci.edu X-ITS-Spam-Status: No X-Virus-Checked: Checked by ClamAV on apache.org --ffoCPvUAPMgSXi6H Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Mon, May 14, 2012 at 01:42:00PM -0700, Jens Alfke wrote: >=20 > On May 14, 2012, at 1:08 PM, James Marca wrote: >=20 > For example, I have detector data with one record per 30 seconds. If > I combine data into daily docs and save, after compaction the > resulting database is much smaller than if I keep one document per > observation. >=20 > Isn=E2=80=99t that just because there are a lot fewer nodes in the b-tree? >=20 > The disadvantage of large documents is that they=E2=80=99re expensive to > update, and they don=E2=80=99t play well with sync (if there are multiple > writers) as they become very prone to conflicts. yes, and if they get too big they get slow to open up with JSON parsers (which generally aren't streaming parsers). There are always trade-offs James --ffoCPvUAPMgSXi6H Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.6 (GNU/Linux) iD8DBQFPsZww+t/6L/9qydcRAmvEAJ920UWgMK7g0QXJlTg+VEiw6yPHlgCfajmH Egx1YpqR0036DCoAODnjYcg= =WZsj -----END PGP SIGNATURE----- --ffoCPvUAPMgSXi6H-- -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean.