Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0E903EFC6 for ; Fri, 11 Jan 2013 10:43:39 +0000 (UTC) Received: (qmail 45530 invoked by uid 500); 11 Jan 2013 10:43:31 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 45392 invoked by uid 500); 11 Jan 2013 10:43:30 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 45359 invoked by uid 99); 11 Jan 2013 10:43:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Jan 2013 10:43:30 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of monoliv@gmail.com designates 209.85.212.54 as permitted sender) Received: from [209.85.212.54] (HELO mail-vb0-f54.google.com) (209.85.212.54) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Jan 2013 10:43:24 +0000 Received: by mail-vb0-f54.google.com with SMTP id l1so1289556vba.13 for ; Fri, 11 Jan 2013 02:43:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=mxWrbx5qiPYN2aNmNZG/BSvoi/H2sFPbDQOztcSsMBQ=; b=EdpBQ0eJonB/q8ZZ0wNsGHbDuWdQTFI40vTFbaP8Wn/9i5M3iaTVrUX7+sloIkbz4G z3MCt5Ug4E69WPLjeO4w5dU8nMuHT+czwcr6j/v2TBFzY9vZuLd5Ynvj0NOxq0MLB0lu SKO16fB31nMuDiPu3x50MFnlr2TrNfjQp7MgfC3oFjBT2RA4WnSdsZS9jZBCqrWF8nE5 U78gB1DtUJQwFvm9lYwWd6dUS3i6AK9O5nHfAVcRGMJ+f+QsbaV95ovmMMU1Guq6g2hJ OeNgk2/pT4OJEM4skq3r1AO/dtwpW1EDSXlG2n1oY3SW4qRbRZy3/XKbFPX5zBw7x26V 3hYw== MIME-Version: 1.0 Received: by 10.58.162.130 with SMTP id ya2mr96620119veb.2.1357900983322; Fri, 11 Jan 2013 02:43:03 -0800 (PST) Received: by 10.58.247.195 with HTTP; Fri, 11 Jan 2013 02:43:03 -0800 (PST) In-Reply-To: <869970D71E26D7498BDAC4E1CA92226B3FCDB754@MBX021-E3-NJ-2.exch021.domain.local> References: <869970D71E26D7498BDAC4E1CA92226B3FCDB754@MBX021-E3-NJ-2.exch021.domain.local> Date: Fri, 11 Jan 2013 10:43:03 +0000 Message-ID: Subject: Re: Getting started recommendations From: Olivier Renault To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=047d7b6766dcc0472e04d300f58b X-Virus-Checked: Checked by ClamAV on apache.org --047d7b6766dcc0472e04d300f58b Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Hi, Warning, I am a newby myself. Please find my answer inline. Good luck Olivier On 11 January 2013 10:29, John Lilley wrote: > We are somewhat new to Hadoop and are looking to run some experiments > with HDFS, Pig, and HBase. **** > > With that in mind, I have a few questions:**** > > What is the easiest (preferably free) Hadoop distro to get started with? > Cloudera? > Cloudera is probably easy. I've gone with the solution from Hortonworks. I've used their hmc ( Hortonworks Management Console ). It's a webui which installed all the components you desired on your behalf as well as installing monitoring ( ganglia + nagios ). HMC is based on Ambari ( apache project ). You can find some information on how to install it at : http://hortonworks.com/hdp11-hmc-quick-start-guide/ > **** > > What host OS distro/release is recommended? > CentOS6 / RHEL6 seems to be a good solution. > **** > > What is the easiest environment to get started with? Amazon EC2? Is > there anyone offering virtual/hosted prebuilt Hadoop instances? > I've installed it on EC2. It worked like a charm > **** > > Where would we find some =93big data=94 files that people have used for > testing purposes? > As part of the documentation, there is a map reduce tutorial. You can then use any files and use the wordcount examples. http://hadoop.apache.org/docs/r0.20.2/mapred_tutorial.html > **** > > Feel free to RTFM me to the right place ;-)**** > > Thanks, john**** > > ** ** > --047d7b6766dcc0472e04d300f58b Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable
Hi,=A0

Warning, I am a newby myse= lf. Please find my answer inline.=A0

Go= od luck
Olivier
On 11 January 2013 10:29, John Lilley <john.lilley@redpoint.net<= /a>> wrote:

We are somewhat new to Hadoop and are looking to run = some experiments with HDFS, Pig, and HBase.=A0

With that in mind, I have a few questions:<= /u>

What is the easiest (preferably free) Hadoop distro t= o get started with?=A0 Cloudera?

What host OS distro/release is recommended?

CentOS6 / RHEL6 seems to be a good sol= ution.=A0
=A0

What is the easiest environment to get started with?= =A0 Amazon EC2? =A0Is there anyone offering virtual/hosted prebuilt Hadoop = instances?

I've installed it on EC2. It worked= like a charm
=A0

Where would we find some =93big data=94 files that pe= ople have used for testing purposes?

As part of the documentation, there is a map reduce tutorial. You can then = use any files and use the wordcount examples. http://hadoop.apache.org/docs/r0.= 20.2/mapred_tutorial.html

Feel free to RTFM me to the right place ;-)=

Thanks, john

=A0


--047d7b6766dcc0472e04d300f58b--