Return-Path: X-Original-To: apmail-couchdb-user-archive@www.apache.org Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EBB4810C25 for ; Fri, 12 Jul 2013 19:03:43 +0000 (UTC) Received: (qmail 92187 invoked by uid 500); 12 Jul 2013 19:03:42 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 91972 invoked by uid 500); 12 Jul 2013 19:03:42 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 91964 invoked by uid 99); 12 Jul 2013 19:03:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Jul 2013 19:03:42 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jens@couchbase.com designates 206.225.164.30 as permitted sender) Received: from [206.225.164.30] (HELO EXHUB020-3.exch020.serverdata.net) (206.225.164.30) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Jul 2013 19:03:37 +0000 Received: from EXVMBX020-1.exch020.serverdata.net ([169.254.4.191]) by EXHUB020-3.exch020.serverdata.net ([206.225.164.30]) with mapi; Fri, 12 Jul 2013 12:03:15 -0700 From: Jens Alfke To: "user@couchdb.apache.org" , "matthias.eck@gmail.com" Date: Fri, 12 Jul 2013 12:03:15 -0700 Subject: Re: Database too large for partition Thread-Topic: Database too large for partition Thread-Index: Ac5/MnZZCPI4k4qMRp+w2mrJ0btDkQ== Message-ID: <2E279AEA-D6E5-4DBB-A599-288E2FBCB2A8@couchbase.com> References: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org On Jul 11, 2013, at 12:20 PM, Matthias Eck wrote: > The database file itself has about 500GB, the views have another 500GB. Is the database 500GB of pure JSON, or are there attachments? (The idea of = 500GB of JSON boggles my mind, but then, I=92m not a big-data guy.) I ask because if a lot of that size is attachments, you can save space by s= toring those elsewhere, like in S3, and leaving just URLs or other IDs in t= he docs. This will also speed up compaction because the attachment data doe= sn=92t have to be copied. The drawback is that you=92ll have to manually de= lete attachments after their corresponding doc[s] are deleted or updated. =97Jens=