Return-Path: Delivered-To: apmail-hbase-user-archive@www.apache.org Received: (qmail 62457 invoked from network); 22 Nov 2010 10:01:38 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 22 Nov 2010 10:01:38 -0000 Received: (qmail 4337 invoked by uid 500); 22 Nov 2010 10:02:09 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 4309 invoked by uid 500); 22 Nov 2010 10:02:08 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 4301 invoked by uid 99); 22 Nov 2010 10:02:08 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Nov 2010 10:02:08 +0000 X-ASF-Spam-Status: No, hits=4.4 required=10.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of fnord999@googlemail.com designates 209.85.216.41 as permitted sender) Received: from [209.85.216.41] (HELO mail-qw0-f41.google.com) (209.85.216.41) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Nov 2010 10:02:01 +0000 Received: by qwi2 with SMTP id 2so2315654qwi.14 for ; Mon, 22 Nov 2010 02:01:40 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:mime-version:received:received:date:message-id :subject:from:to:content-type; bh=gy3E1AQZ2R1V/LxzxTzhgvhQtKECK1Q4qUo5QPTKfqY=; b=YtzT9W280c8Y+IFjTk6UME3F9Ten2ZfUePSn1LaEP/+z/cIdO830VR+y0WD7uwq7XE XVvj/hKmJyKiPxhtdQRtxGgakL0URH8t0W0qdkVH7mY3zKReLJK4TXf9yBsngYPQJJyU y/VvA9UbdkMAPgD93q7izGr6Pk7rDcTTEHwBI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=IC88XCqgc+YbnHY0ydWeB9asJLgioVVgGwqAeL/11UUkId2SBippg9mCuhhmMkvYq7 pUgN/1KtBzsOJ/5LSauNYX3KjJMvphzSw3YYEdhTYL+JMITwg7ACxseFQp1XdVHl5W8q dWDvNiLYn8w/jVILww+0SsZOpfXPsOCfumqfM= MIME-Version: 1.0 Received: by 10.224.200.2 with SMTP id eu2mr4091906qab.258.1290420100699; Mon, 22 Nov 2010 02:01:40 -0800 (PST) Received: by 10.224.11.74 with HTTP; Mon, 22 Nov 2010 02:01:40 -0800 (PST) Date: Mon, 22 Nov 2010 11:01:40 +0100 Message-ID: Subject: LazyFetching of Row Results in MapReduce From: fnord 99 To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=20cf300fb4c1b694ac0495a157fc X-Virus-Checked: Checked by ClamAV on apache.org --20cf300fb4c1b694ac0495a157fc Content-Type: text/plain; charset=ISO-8859-1 Hi all, I recently filled an hbase table with many millions of columns in each row (!). The problem that now occured was that I always get a Heap Space Error from the JVM with a subsequent shutdown of all regionservers in which the error occurs. Since the error isn't thrown in any of my own classes, I think that the problem is the following: * a row is always completely read into memory upon access (at least all column families that I'm interested in) * the Result object holds the complete family-qualifier-value pairs in a KeyValue[] * this is sometimes too much to be handled by the physical memory each map can get, therefore a heap space error is thrown My question is now: is there any lazy fetching technique implemented within the single key-values within one row? In my opinion it should be but I couldn't find anything in the source code or wiki that hints to that. Any ideas on how to go around this problem? I had the idea to rebuild the table schema to store more data in the row key and less data in the column families which would make the tables "thinner" and "longer". It would work in the current setup, however, it wouldn't solve the original problem... Thanks already in advance for any input on that, fnord999 --20cf300fb4c1b694ac0495a157fc--