Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 34005 invoked from network); 18 Jun 2009 02:46:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 18 Jun 2009 02:46:58 -0000 Received: (qmail 73085 invoked by uid 500); 18 Jun 2009 02:47:08 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 73026 invoked by uid 500); 18 Jun 2009 02:47:08 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 73016 invoked by uid 99); 18 Jun 2009 02:47:08 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 18 Jun 2009 02:47:08 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [208.53.66.14] (HELO locke.effortlessis.com) (208.53.66.14) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 18 Jun 2009 02:46:57 +0000 Received: from turing.schoolpathways.com (adsl-76-193-72-245.dsl.chi2ca.sbcglobal.net [76.193.72.245]) (authenticated bits=0) by locke.effortlessis.com (8.13.1/8.13.1) with ESMTP id n5I2kUvp032548 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Wed, 17 Jun 2009 19:46:32 -0700 From: Benjamin Smith To: user@couchdb.apache.org Subject: High volume couchDB? Date: Wed, 17 Jun 2009 19:46:29 -0700 User-Agent: KMail/1.11.3 (Linux/2.6.27.24-170.2.68.fc10.i686; KDE/4.2.3; i686; ; ) MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="Boundary-00=_FqaOKtmxbqGSEDp" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200906171946.29601.lists@benjamindsmith.com> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (locke.effortlessis.com [208.53.66.14]); Wed, 17 Jun 2009 19:46:33 -0700 (PDT) X-EffortlessIS-MailScanner-Information: Please contact the ISP for more information X-EffortlessIS-MailScanner: Found to be clean X-EffortlessIS-MailScanner-From: lists@benjamindsmith.com X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Status: No --Boundary-00=_FqaOKtmxbqGSEDp Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit I recently ran across this project while doing research into Erlang for High Availability, from what I can see, this project may be exactly what I've been looking for. We've been using a clustered filesystem API for our large php application to keep files and related, semi-structured data. It was developed in-house. We see perhaps 250,000 file operations per day, with 3 nodes to store data. Think: Server-level RAID 1. Total data size is ~ 1 TB, with about 50% growth per year. What I'm looking for: 1) Ability to store misc files. (Mixed: PDFs, JPGs, iso images, text files, etc) 2) Ability to store related metadata close by (time stamp, ownership data, application-specific data, etc) We do this now by having a "sister" file with the data in PHP format serialized with a ".mdt" extension. 3) Redundancy: zero data loss in the event of a server failure. We achieve this now by having our own, developed-in-house file server daemon running under xinet.d. Conceptually, it's similar to WebDAV, but lighter weight. 4) Failover: ability to keep working even with partial cluster failure. 5) Healing: ability to get "back together" when downed servers are restored. 6) Performance that degrades gracefully: What happens when the screws get put to CouchDB? What kinds of loads can it sustain given mid-range hardware? 7) Backups off-site: Disaster Recovery plans, currently we're using rsync run nightly. 8) Reliability: It should "just work" without needing regular babysitting. Am I right in reading that CouchDB accomplishes all/most/many of these goals? If not all of them, which would need watching? Thanks! -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean. --Boundary-00=_FqaOKtmxbqGSEDp--