Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 623D7D999 for ; Wed, 12 Sep 2012 11:53:16 +0000 (UTC) Received: (qmail 53480 invoked by uid 500); 12 Sep 2012 11:53:14 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 52833 invoked by uid 500); 12 Sep 2012 11:53:11 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 52814 invoked by uid 99); 12 Sep 2012 11:53:10 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Sep 2012 11:53:10 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=FSL_RCVD_USER,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of brian.jeltema@digitalenvoy.net designates 64.88.168.16 as permitted sender) Received: from [64.88.168.16] (HELO barracuda.digitalenvoy.net) (64.88.168.16) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 12 Sep 2012 11:53:01 +0000 X-ASG-Debug-ID: 1347450759-459eac0b0001-f7dORa Received: from brian-jeltema.employees.digitalenvoy.net ([64.129.218.66]) by barracuda.digitalenvoy.net with ESMTP id V94vXHlFRf93KbXK (version=TLSv1 cipher=AES128-SHA bits=128 verify=NO) for ; Wed, 12 Sep 2012 07:52:39 -0400 (EDT) X-Barracuda-Envelope-From: brian.jeltema@digitalenvoy.net X-Barracuda-Apparent-Source-IP: 64.129.218.66 X-ASG-Whitelist: Client From: Brian Jeltema Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Subject: hadoop inserts blow out heap Date: Wed, 12 Sep 2012 07:52:39 -0400 X-ASG-Orig-Subj: hadoop inserts blow out heap Message-Id: <4CB164F1-B76F-47D3-9F1F-7E12D4F1203E@digitalenvoy.net> To: user@cassandra.apache.org Mime-Version: 1.0 (Apple Message framework v1278) X-Mailer: Apple Mail (2.1278) X-Barracuda-Connect: UNKNOWN[64.129.218.66] X-Barracuda-Start-Time: 1347450759 X-Barracuda-Encrypted: AES128-SHA X-Barracuda-URL: http://barracuda.digitalenvoy.net:8000/cgi-mod/mark.cgi X-Virus-Scanned: by bsmtpd at digitalenvoy.net I'm a fairly novice Cassandra/Hadoop guy. I have written a Hadoop job = (using the Cassandra/Hadoop integration API) that performs a full table scan and attempts to populate a new table = from the results of the map/reduce. The read works fine and is fast, but the table insertion is failing with OOM = errors (in the Cassandra VM). The resulting heap dump from one node = shows that 2.9G of the heap is consumed by a JMXConfigurableThreadPoolExecutor that = appears to be full of batch mutations. I'm using a 6-node cluster, 32G per node, 8G heap, RF=3D3, if any of = that matters. Any suggestions would be appreciated regarding configuration changes or = additional information I might capture to understand this problem. Thanks Brian J=