Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 70CD4DF33 for ; Mon, 23 Jul 2012 15:34:42 +0000 (UTC) Received: (qmail 46317 invoked by uid 500); 23 Jul 2012 15:34:42 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 46292 invoked by uid 500); 23 Jul 2012 15:34:42 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 46275 invoked by uid 99); 23 Jul 2012 15:34:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 Jul 2012 15:34:41 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [206.112.75.238] (HELO iron-ugovb-out.osis.gov) (206.112.75.238) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 Jul 2012 15:34:34 +0000 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AuUEAINuDVCsEAbx/2dsb2JhbABFuTCBD4IgAQEEAX4LCwQHOyEBEwUdGYd+AwazJw2JTopmZ4ZTA5N1gVSLCod+ X-IronPort-AV: E=Sophos;i="4.77,639,1336363200"; d="scan'208";a="247892" Received: from ghost-a.center.osis.gov (HELO mail-vb0-f41.google.com) ([172.16.6.241]) by iron-ugovb-in.osis.gov with ESMTP/TLS/RC4-SHA; 23 Jul 2012 11:31:30 -0400 Received: by vbkv13 with SMTP id v13so5790046vbk.0 for ; Mon, 23 Jul 2012 08:34:08 -0700 (PDT) Received: by 10.52.94.36 with SMTP id cz4mr11356675vdb.10.1343057648243; Mon, 23 Jul 2012 08:34:08 -0700 (PDT) MIME-Version: 1.0 Received: by 10.220.200.76 with HTTP; Mon, 23 Jul 2012 08:33:47 -0700 (PDT) In-Reply-To: <1821814328.145869.1343056892991.JavaMail.root@linzimmb04o.imo.intelink.gov> References: <1821814328.145869.1343056892991.JavaMail.root@linzimmb04o.imo.intelink.gov> From: John Vines Date: Mon, 23 Jul 2012 11:33:47 -0400 Message-ID: Subject: Re: memory usage & process distribution To: dev@accumulo.apache.org Content-Type: multipart/alternative; boundary=20cf307f31660954a904c580fabd --20cf307f31660954a904c580fabd Content-Type: text/plain; charset=ISO-8859-1 On Mon, Jul 23, 2012 at 11:21 AM, Miguel Pereira wrote: > Hey guys, > > I want to set up a realistic production cluster on Amazon's EC2 and I am > trying to decide 2 things. > > > - Memory usage > > If I use one of the example configuration files, say the 512MB does that > mean that all Accumulo processes will use up a total of 512MB? At least > this appears to be the case when looking at the accumulo-env.sh > This will determine weather I use a small or large instance. > > > Yes, it sets it up so all of the Accumulo processes have a footprint no bigger than 512MB. Mind you, we only have one configuration that is set up for things in a distributed fashion, which is 3GB. So if you're running multiple nodes, you can up some of the configurations for a larger footprint because you won't be running every process on every node. > - Process Distribution > > Is this a standard configuration? I will start off with a small # of worker > nodes ( 3-4 ) & hope to use my local machine as a "monitor" for the > accumulo & ganglia web UI's in order to avoid ssh -X latency. > > [ Name Node ] Name Node, Gmond > [ Secondary NN ] Secondary Name Node, Gmond > [ Job Tracker ] JobTracker, Gmond > [ Zookeeper ] Zookeeper > [ Accumulo Master ] Master, Tracer, Garbage Collector, Gmond, Jmxtrans > [ Monitor ] Monitor, Gmetad, Gweb > [ Worker Node ] DataNode, Tasktracker, TabletServer, Logger, Gmond, > Jmxtrans > > That looks good to me. Just make sure you configure your map reduce to that child memory * (reduce slots + map slots) aren't enough to cause swapping. > > Thanks, > > Miguel > John --20cf307f31660954a904c580fabd--