Return-Path: X-Original-To: apmail-jackrabbit-oak-dev-archive@minotaur.apache.org Delivered-To: apmail-jackrabbit-oak-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6ADF2105A4 for ; Mon, 15 Jul 2013 11:20:23 +0000 (UTC) Received: (qmail 5530 invoked by uid 500); 15 Jul 2013 11:20:23 -0000 Delivered-To: apmail-jackrabbit-oak-dev-archive@jackrabbit.apache.org Received: (qmail 5403 invoked by uid 500); 15 Jul 2013 11:20:22 -0000 Mailing-List: contact oak-dev-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: oak-dev@jackrabbit.apache.org Delivered-To: mailing list oak-dev@jackrabbit.apache.org Received: (qmail 5395 invoked by uid 99); 15 Jul 2013 11:20:21 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 15 Jul 2013 11:20:21 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of chetan.mehrotra@gmail.com designates 209.85.214.174 as permitted sender) Received: from [209.85.214.174] (HELO mail-ob0-f174.google.com) (209.85.214.174) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 15 Jul 2013 11:20:15 +0000 Received: by mail-ob0-f174.google.com with SMTP id wd20so13706826obb.19 for ; Mon, 15 Jul 2013 04:19:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=xSWPXMzPjm0Hv+fULig1KYQyodxHKHSewWprkpfk0tg=; b=P5PXXuJ2NoBLDR4F8hKRIiXq3W7K2OuWP5q5K08YY1av6ZoWGJ587opkej42wW5Onf ajFGxQzky9Qb0Yw3S6JCaSs6rVTWNgGFlCMLH6+/Xp6jVCLxn3Mkb5EsuMEu2xjdBp4N bHRRxeKUB51MNIaDWpkkT0W7uDQt/NQKAigHIdxmGHalhWuaXVwxpjQeAJXejeiqDAk/ B7Vkijy7e26kQOzGjusfk9XP1Ltbtmhg0OZnxDLIqKsAubIrDuXg+OdgrGRNVp58x1E/ pLK2bgN49c3mV0a7KKY/CV2WBUIov2Ai0YHD2FVeOlV5BYEV5efrxq0ihA3qPCEk4yll e7NA== MIME-Version: 1.0 X-Received: by 10.60.141.70 with SMTP id rm6mr42844170oeb.20.1373887194251; Mon, 15 Jul 2013 04:19:54 -0700 (PDT) Received: by 10.60.6.99 with HTTP; Mon, 15 Jul 2013 04:19:54 -0700 (PDT) In-Reply-To: References: Date: Mon, 15 Jul 2013 16:49:54 +0530 Message-ID: Subject: Re: Very large blobid with MongoMK From: Chetan Mehrotra To: oak-dev@jackrabbit.apache.org Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org > This is not just one blob id, but a list of blob ids. It's 796 blob ids, > each blob id is 8 KB. Total size is about 6 MB. Okie its a multi valued property and in OakDirectory for Lucene the block size is set to 4092 while the minBlockSize in AbstractBlobStore is 4096. So effectively all the lucene index blocks are getting inlined. Should we tweak either of limits so that these blobs do not get inlined and instead saved separately otherwise I think this array would soon outgrow the 16 MB limit? Chetan Mehrotra On Mon, Jul 15, 2013 at 4:02 PM, Thomas Mueller wrote: > Hi, > > This is not just one blob id, but a list of blob ids. It's 796 blob ids, > each blob id is 8 KB. Total size is about 6 MB. > > Regards, > Thomas > > On 7/15/13 11:46 AM, "Chetan Mehrotra" wrote: > >>> The blob id shouldn't be larger than a few KB at most (it can be >>> configured). >> >>Yup thats what I was thinking after looking into the >>org.apache.jackrabbit.mk.blobs.AbstractBlobStore#convertBlobToId. >>Would have a closer look. >> >>Chetan Mehrotra >> >> >>On Mon, Jul 15, 2013 at 2:53 PM, Thomas Mueller wrote= : >>> Hi, >>> >>> The blob id shouldn't be larger than a few KB at most (it can be >>> configured). >>> >>> Regards, >>> Thomas >>> >>> >>> >>> On 7/15/13 11:13 AM, "Chetan Mehrotra" >>>wrote: >>> >>>>Hi, >>>> >>>>I was trying to get an estimate of the size [1] of various nodes >>>>document in MongoDB for a fresh CQ installation. The largest node was >>>>for path =8C4:/oak:index/lucene/:data/_5_Lucene41_0.tim =8C weighing >>>>upward of 6 MB. It has one binary property jcr:data >>>> >>>>"jcr:data" : { >>>> "r13fd2c82e10-0-1" : >>>>"[\":blobId:00fc1f3fd76c1715424c4f4.....00031aea1\"] >>>>} >>>> >>>>The value stored above is very large. Before digging in further wanted >>>>to check if this is expected or the blobid should be something >>>>smaller? >>>> >>>>Regards >>>>Chetan >>>> >>>>[1] var max =3D 0; >>>>var maxObj >>>>db.nodes.find().forEach(function(obj) { >>>> var curr =3D Object.bsonsize(obj); >>>> if(max < curr) { >>>> max =3D curr; >>>> maxObj =3D obj; >>>> } >>>>}) >>>>print(max); >>>>printjson(maxObj); >>>> >>>>Chetan Mehrotra >>> >