Return-Path: Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: (qmail 94443 invoked from network); 23 Feb 2011 09:18:49 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 23 Feb 2011 09:18:49 -0000 Received: (qmail 58610 invoked by uid 500); 23 Feb 2011 09:18:49 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 58316 invoked by uid 500); 23 Feb 2011 09:18:46 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 58307 invoked by uid 99); 23 Feb 2011 09:18:45 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Feb 2011 09:18:45 +0000 X-ASF-Spam-Status: No, hits=-2.8 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_HI,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [192.93.158.29] (HELO thsbbfxrt02p.thalesgroup.com) (192.93.158.29) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 Feb 2011 09:18:38 +0000 Received: from thsbbfxrt02p.thalesgroup.com (localhost.localdomain [127.0.0.1]) by localhost (Postfix) with SMTP id D625D37A833 for ; Wed, 23 Feb 2011 10:18:13 +0100 (CET) From: MONTMORY Alain To: "mapreduce-user@hadoop.apache.org" Date: Wed, 23 Feb 2011 10:18:14 +0100 Subject: RE: easiest way to install hadoop Thread-Topic: easiest way to install hadoop Thread-Index: AcvTC8bjPTMFcZFNSG6IuWGyFn5pzAAK3mwA Message-ID: <16031_1298452693_4D64D0D5_16031_14091_1_ECB646CBB62FD044927FDEDF0252D3E701DE120E41B5@THSONEA01CMS03P.one.grp> References: In-Reply-To: Accept-Language: fr-FR Content-Language: fr-FR X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: fr-FR x-pmwin-version: 3.1.0.0, Antivirus-Engine: 3.15.0, Antivirus-Data: 4.61E Content-Type: multipart/alternative; boundary="_000_ECB646CBB62FD044927FDEDF0252D3E701DE120E41B5THSONEA01CM_" MIME-Version: 1.0 --_000_ECB646CBB62FD044927FDEDF0252D3E701DE120E41B5THSONEA01CM_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi, For my point of view it is not a trivial question... The latest "stable release" is 0.20.2 (embedded in cloudera CH3) (and not = 0.21)... When you start with hadoop recently (end 2010 for me) you are facing "old A= PI" depreceated, so you start with using new API... But in 0.20.2 not all the new API are available under mapreduce (example Mu= ltipleInput is not available), so you try 0.21 version where it is availabl= e... But the 0.21 seems for me not very stable (we are facing a "null pointer ex= eception" in framework logs without any idea to solve it), so we scope down= to 0.20.2 and we are using "Old API". search " Re: Which version to choose" in the mailing list and follow the ad= vice of Todd Lipton. The "old API" are not so depreceated, they will be supported for years beca= use there is thousand jobs running on them. The "new API" could be used whe= n a stable release will be up (0.22, 0.23..). It is the feeback of my personal experience where i lost time trying to use= the latest 0.21 version... Since i use cloudera 0.20.2+320 with old API an= d i don't have any problem (we are also using Cascading to simplify MR writ= ting with very little overhead on performance (6%) versus native hadoop MR = jobs. Overall we gain 4,65 factor versus traditionnal RDBMS approach.... Hopes this help you, regards Alain [@@THALES GROUP RESTRICTED@@] De : real great.. [mailto:greatness.hardness@gmail.com] Envoy=E9 : mercredi 23 f=E9vrier 2011 04:42 =C0 : mapreduce-user@hadoop.apache.org Objet : easiest way to install hadoop Hi, Very trivial question. Which is the easiest way to install hadoop? i mean which distribution should i go for?? apache or cloudera? n which is the easiest os for hadoop? -- Regards, R.V. --_000_ECB646CBB62FD044927FDEDF0252D3E701DE120E41B5THSONEA01CM_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable

Hi,

 

For my point of view it is not a trivial question…

 

The latest "stable release" is 0.20.2 (embedded in= cloudera=A0 CH3) (and not 0.21)…

When you start with hadoop recently (end 2010 for me) you ar= e facing "old API" depreceated, so you start with using new APIR= 30;

But in 0.20.2 not all the new API are available under mapred= uce (example MultipleInput is not available), so you try 0.21 version where it = is available…

But the 0.21 seems for me not very stable (we are facing a "null pointer exeception" in framework logs without any idea to s= olve it), so we scope down to 0.20.2 and we are using "Old API".<= /o:p>

 

search " Re: Which version to choose" in = the mailing list and follow the advice of Todd Lipton.

The "old API" are not so depreceated, they will be supported for years because there is thousand jobs running on them. The "new API" could be used when a stable release will be up (0.22, 0= .23..).

 

It is the feeback of my personal experience where i lost tim= e trying to use the latest 0.21 version… Since i use cloudera 0.20.2+32= 0 with old API and i don't have any problem (we are also using Cascading to simpli= fy MR writting with very little overhead on performance (6%) versus native hadoop= MR jobs. Overall we gain 4,65 factor versus traditionnal RDBMS approach…= .

 

Hopes this help you,

 

regards

 

Alain

[@@THALES GROUP RESTRICTED@@]

 

De : real great.. [mailto:greatness.hardness@gmail.com]
Envoy=E9 : mercredi 23 f=E9vrier 2011 04:42
=C0 : mapreduce-user@hadoop.apache.org
Objet : easiest way to install hadoop

 

Hi,
Very trivial question.
Which is the easiest way to install hadoop?
i mean which distribution should i go for?? apache or cloudera?
n which is the easiest os for hadoop?

--
Regards,
R.V.

--_000_ECB646CBB62FD044927FDEDF0252D3E701DE120E41B5THSONEA01CM_--