Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 64343 invoked from network); 19 Jan 2010 00:43:28 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 19 Jan 2010 00:43:28 -0000 Received: (qmail 33782 invoked by uid 500); 19 Jan 2010 00:43:27 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 33730 invoked by uid 500); 19 Jan 2010 00:43:27 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 33720 invoked by uid 99); 19 Jan 2010 00:43:27 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 Jan 2010 00:43:27 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of gsiasf@gmail.com designates 209.85.211.185 as permitted sender) Received: from [209.85.211.185] (HELO mail-yw0-f185.google.com) (209.85.211.185) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 Jan 2010 00:43:17 +0000 Received: by ywh15 with SMTP id 15so2802755ywh.5 for ; Mon, 18 Jan 2010 16:42:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:sender:content-type :mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=ioMaBiflzcyWZ8/HUysZ9GHOPrus9P9XSN4tXB949lo=; b=wiJgcNFlgrz7chaTJetY8aMU8YiSiawh8nm7Ebj8luBGG4O6waV/RypWTD8mF/gQpZ jPZBkkYmVC2LXu5P7c0PgJPBB47awIsaxVf/YM4VXXA3HcOJh3W1JMvaL0lXhNChulI6 Wi5zSSC5SiPpD988/mYCO65cI/QNJF9gM899s= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; b=pRyMTEA+KQdZVs82X66gb/YrtykKhzmGsHfD0z4b2hAQlNCDPSMiByP6DE3IgmnDEl MDrET/lEF8qQ7SWBFM41iIGc21qxZ6GVdysB+cvVubd6awAXj+yWprDnBNUYi+XynXwl 3RoxB9YLMWS/ds+jNd6QFFAqDIs8UT2NsJpEY= Received: by 10.150.47.30 with SMTP id u30mr6074225ybu.260.1263861776914; Mon, 18 Jan 2010 16:42:56 -0800 (PST) Received: from ?10.0.0.77? (adsl-065-013-152-164.sip.rdu.bellsouth.net [65.13.152.164]) by mx.google.com with ESMTPS id 23sm2258372ywh.18.2010.01.18.16.42.55 (version=TLSv1/SSLv3 cipher=RC4-MD5); Mon, 18 Jan 2010 16:42:56 -0800 (PST) Sender: Grant Ingersoll Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1077) Subject: Re: Re : Good starting instance for AMI From: Grant Ingersoll In-Reply-To: Date: Mon, 18 Jan 2010 19:42:54 -0500 Content-Transfer-Encoding: quoted-printable Message-Id: <5DEEA81F-7EA8-4024-AC85-88E5F4DBFA08@apache.org> References: <629059.62746.qm@web26303.mail.ukl.yahoo.com> To: mahout-user@lucene.apache.org X-Mailer: Apple Mail (2.1077) X-Virus-Checked: Checked by ClamAV on apache.org On Jan 18, 2010, at 3:15 PM, Ted Dunning wrote: > Is there an important difference between creating an existing AMI or = using > an existing AMI with a startup script that populates everything from = S3? >=20 > Building an AMI takes a few hours of time and is a total pain in the = butt. > My eventual result was that I didn't need to do it at all. >=20 > I found that I had roughly three levels of variation in my production > systems: >=20 > - the OS > - the infrastructural components like java, hadoop and zookeeeper > - the application that I wanted to run >=20 > My initial thought was that the AMI should cover the first two aspects = of > variability. But I also found that I wanted to change the version of = the > infrastructure stuff fairly often in development of the AMI and not > infrequently in production. >=20 > For Mahout customers, I would imagine that there is a reasonable = amount of > variability in desired OS (Ubuntu versus Redhat versus Centos at = least), JDK > and Hadoop versions. =20 I only see a need for two: the version in trunk and the one in latest = release. This is all well and good, but I have yet to see anyone say: here's the = AMI, the download script and the instructions. So I'm just going to go = ahead with what I think is useful for my needs, document it, and put it = up there for people to use or not. If anything, it will be useful for = me to do it since I've never setup a Hadoop cluster on EC2 before. =20 -Grant=