Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ACEE51096C for ; Sat, 2 Nov 2013 15:29:40 +0000 (UTC) Received: (qmail 95624 invoked by uid 500); 2 Nov 2013 15:29:36 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 95176 invoked by uid 500); 2 Nov 2013 15:29:30 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 95168 invoked by uid 99); 2 Nov 2013 15:29:27 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 02 Nov 2013 15:29:27 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of johnnyenglish739@gmail.com designates 209.85.215.54 as permitted sender) Received: from [209.85.215.54] (HELO mail-la0-f54.google.com) (209.85.215.54) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 02 Nov 2013 15:29:23 +0000 Received: by mail-la0-f54.google.com with SMTP id n7so1146729lam.13 for ; Sat, 02 Nov 2013 08:29:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=gAtmwXHbqqXqrQ/1JCrqXp9dbZrOkC3X5yNv7cnk62g=; b=qDeIjinqquA6z9UTGvjLncN1oCnB52cA7QTkh2Wnfqk8cSM+UtRpJVjSEouDmks4HR Ku1Juxc+FwVjb/lISZj6uMHY0Wqm/GQXklqjlcVgx3+Th+BoC7Pvmpz3LaG+U8jsIqWN PhBsJHiSJXpd5kroGt6VMY26St/IVYNbW/2Rf/dFgP12tYwKytD855Ez3G/oPfYe8ge9 qKBWF8a/tsky8TUqRTG1PqUVbNSFgWwLNDbircscv64CuYVid6te0qKCHWnoNtwxtVs6 c0ltbRjJnM8tFFke2N8u+D4fzIQkdFdq9JlbU9eVmHvlZdHD6asB/buhASY0kkQNDePx 2Gqg== MIME-Version: 1.0 X-Received: by 10.112.210.66 with SMTP id ms2mr85233lbc.51.1383406142148; Sat, 02 Nov 2013 08:29:02 -0700 (PDT) Received: by 10.114.181.39 with HTTP; Sat, 2 Nov 2013 08:29:02 -0700 (PDT) In-Reply-To: <84A93B85-3833-45C4-9EC1-388E7DB4EFDB@gmail.com> References: <84A93B85-3833-45C4-9EC1-388E7DB4EFDB@gmail.com> Date: Sat, 2 Nov 2013 16:29:02 +0100 Message-ID: Subject: Re: OutOfMemoryError in MapReduce Job From: John To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a11c3145caed3e004ea33571c X-Virus-Checked: Checked by ClamAV on apache.org --001a11c3145caed3e004ea33571c Content-Type: text/plain; charset=ISO-8859-1 I already use LZO compression in HBase. Or do you mean a compressed Java object? Do you know an implementation? kind regards 2013/11/2 Asaf Mesika > I would try to compress this bit set. > > On Nov 2, 2013, at 2:43 PM, John wrote: > > > Hi, > > > > thanks for your answer! I increase the "Map Task Maximum Heap Size" to > 2gb > > and it seems to work. The OutOfMemoryEroror is gone. But the HBase Region > > server are now crashing all the time :-/ I try to store the bitvector > > (120mb in size) for some rows. This seems to be very memory intensive, > the > > usedHeapMB increase very fast (up to 2gb). I'm not sure if it is the > > reading or the writing task which causes this, but I thnk its the writing > > task. Any idea how to minimize the memory usage? My mapper looks like > this: > > > > public class MyMapper extends TableMapper { > > > > private void storeBitvectorToHBase( > > Put row = new Put(name); > > row.setWriteToWAL(false); > > row.add(cf, Bytes.toBytes("columname"), > toByteArray(bitvector)); > > ImmutableBytesWritable key = new ImmutableBytesWritable( > > name); > > context.write(key, row); > > } > > } > > > > > > kind regards > > > > > > 2013/11/1 Jean-Marc Spaggiari > > > >> Ho John, > >> > >> You might be better to ask this on the CDH mailing list since it's more > >> related to Cloudera Manager than HBase. > >> > >> In the meantime, can you try to update the "Map Task Maximum Heap Size" > >> parameter too? > >> > >> JM > >> > >> > >> 2013/11/1 John > >> > >>> Hi, > >>> > >>> I have a problem with the memory. My use case is the following: I've > >> crated > >>> a MapReduce-job and iterate in this over every row. If the row has more > >>> than for example 10k columns I will create a bloomfilter (a bitSet) for > >>> this row and store it in the hbase structure. This worked fine so far. > >>> > >>> BUT, now I try to store a BitSet with 1000000000 elements = ~120mb in > >> size. > >>> In every map()-function there exist 2 BitSet. If i try to execute the > >>> MR-job I got this error: http://pastebin.com/DxFYNuBG > >>> > >>> Obviously, the tasktracker does not have enougth memory. I try to > adjust > >>> the configuration for the memory, but I'm not sure which is the right > >> one. > >>> I try to change the "MapReduce Child Java Maximum Heap Size" value from > >> 1GB > >>> to 2GB, but still got the same error. > >>> > >>> Which parameters do I have to adjust? BTW. I'm using CDH 4.4.0 with the > >>> Clouder Manager > >>> > >>> kind regards > >>> > >> > > --001a11c3145caed3e004ea33571c--