Return-Path: X-Original-To: apmail-jackrabbit-oak-dev-archive@minotaur.apache.org Delivered-To: apmail-jackrabbit-oak-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 58BAC1003B for ; Tue, 15 Oct 2013 07:33:12 +0000 (UTC) Received: (qmail 82514 invoked by uid 500); 15 Oct 2013 07:33:11 -0000 Delivered-To: apmail-jackrabbit-oak-dev-archive@jackrabbit.apache.org Received: (qmail 82379 invoked by uid 500); 15 Oct 2013 07:33:11 -0000 Mailing-List: contact oak-dev-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: oak-dev@jackrabbit.apache.org Delivered-To: mailing list oak-dev@jackrabbit.apache.org Received: (qmail 82371 invoked by uid 99); 15 Oct 2013 07:33:10 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Oct 2013 07:33:10 +0000 X-ASF-Spam-Status: No, hits=0.3 required=5.0 tests=FRT_ADOBE2,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ianboston@gmail.com designates 74.125.82.174 as permitted sender) Received: from [74.125.82.174] (HELO mail-we0-f174.google.com) (74.125.82.174) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Oct 2013 07:33:06 +0000 Received: by mail-we0-f174.google.com with SMTP id u56so8003192wes.19 for ; Tue, 15 Oct 2013 00:32:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=cJbQoqa4qodw1iFonAa16uQMi44/dWxdLcalktq5wLo=; b=w9rTQL5Kfp8QzWF6rbujFr2xG+pPUemdhEpexwen08KCMI37uWU/Kmid5qwbidNoag 2rdPKoCOxInxV8mUbUlWf8zhwDKIYTIfGVUZJKy1F9npD7f91UtnlPDBiDfcyYEVoU3/ L07DM/ZP6IVwlS4Otx0ShNnpuCZiqqRXhl+hEbBZ/T8CQTWd6wDO9pfsVRAmm+CZGMSx 8gggK6BWcmVFoZMq928rjcn+IKU+dOMwBdNK9UGcN/F4INRHFIqS2gogXXSHM9rcCbmw zIvXpNUyqr9RGTzlw0NCzDKlNfkUsIN+jmJ7MdTABzyF17eZP201sQFr+c06Dk6N75qT 1Fvg== MIME-Version: 1.0 X-Received: by 10.194.173.163 with SMTP id bl3mr33641333wjc.10.1381822364971; Tue, 15 Oct 2013 00:32:44 -0700 (PDT) Sender: ianboston@gmail.com Received: by 10.217.43.200 with HTTP; Tue, 15 Oct 2013 00:32:44 -0700 (PDT) In-Reply-To: References: Date: Tue, 15 Oct 2013 08:32:44 +0100 X-Google-Sender-Auth: QYP4vRQPOifLoTFTVylzhWRK4hM Message-ID: Subject: Re: content hash of a tree From: Ian Boston To: "oak-dev@jackrabbit.apache.org" Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org On 15 October 2013 08:13, Thomas Mueller wrote: > Hi, > >>I thought this was one of the fundamental principles that >>allow for quick diff/merge and help identify a commit within the MVCC >>timeline. > > I'm not sure where you heard or read that, but a content hash is not > needed for this; a revision number that is changed whenever there is a > modification is enough. (For each change in this node or any direct or > indirect child node.) For the MongoMK, the revision is a combination of > the timestamp, the cluster node id, and a counter; very similar to the > MongoDB object id: http://docs.mongodb.org/manual/reference/object-id/ > >>what was the reason to abandon this idea? > > As for the MongoDB: performance and scalability. A node lookup by content > hash would be bad for performance, as it would require an index on a > randomly distributed data. See also > http://fr.slideshare.net/daumdna/mongodb-scaling-write-performance - page > 9 (the red line is with an index on the content hash, the green line > without). But even without such an index: maintaining the content hash > would be prohibitively expensive and would prevent scalable writes. With the current state of Oak, is Oak on Cassandra possible ? Slide 45. Ian > > Regards, > Thomas >