Return-Path: Delivered-To: apmail-couchdb-user-archive@www.apache.org Received: (qmail 70738 invoked from network); 6 Apr 2009 15:01:30 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Apr 2009 15:01:30 -0000 Received: (qmail 42340 invoked by uid 500); 6 Apr 2009 15:01:29 -0000 Delivered-To: apmail-couchdb-user-archive@couchdb.apache.org Received: (qmail 42265 invoked by uid 500); 6 Apr 2009 15:01:29 -0000 Mailing-List: contact user-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@couchdb.apache.org Delivered-To: mailing list user@couchdb.apache.org Received: (qmail 42254 invoked by uid 99); 6 Apr 2009 15:01:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Apr 2009 15:01:29 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of b.candler@pobox.com designates 208.72.237.25 as permitted sender) Received: from [208.72.237.25] (HELO sasl.smtp.pobox.com) (208.72.237.25) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Apr 2009 15:01:21 +0000 Received: from localhost.localdomain (unknown [127.0.0.1]) by a-sasl-quonix.sasl.smtp.pobox.com (Postfix) with ESMTP id 33267CA43; Mon, 6 Apr 2009 11:00:59 -0400 (EDT) Received: from mappit (unknown [80.45.95.114]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by a-sasl-quonix.sasl.smtp.pobox.com (Postfix) with ESMTPSA id BA003CA40; Mon, 6 Apr 2009 11:00:56 -0400 (EDT) Received: from brian by mappit with local (Exim 4.69) (envelope-from ) id 1LqqJj-00057E-2J; Mon, 06 Apr 2009 16:00:55 +0100 Date: Mon, 6 Apr 2009 16:00:55 +0100 From: Brian Candler To: Adam Wolff Cc: user@couchdb.apache.org, Tom McNulty Subject: Re: Reduce Assumptions Message-ID: <20090406150054.GA19604@uk.tiscali.com> References: <20090406085100.GA14371@uk.tiscali.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.17+20080114 (2008-01-14) X-Pobox-Relay-ID: BCF37274-22BB-11DE-A184-DC76898A30C1-28021239!a-sasl-quonix.pobox.com X-Virus-Checked: Checked by ClamAV on apache.org On Mon, Apr 06, 2009 at 07:46:37AM -0700, Adam Wolff wrote: > I'm still not quite clear on > the meaning of this: > "reduce functions should not grow its output larger than log(n) where n is > the number of input rows" > > How is the size of the output measured? length of the JSON string? are > the input rows > the size of the value emitted by the map? I don't know the details - I just copy-pasted that from the wiki. > Does this mean it's wrong to > have an entry in > your reduce output for every map key? I believe that's the case. The reduce output forms a tree overlaid onto the document Btree: each Btree node contains N documents plus a single reduce value from those N documents. The Btree could be multiple levels deep, and you don't want to end up with a single enormous value in the root node. But I don't know what that actual size limit is. Regards, Brian.