hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris MacKenzie <stu...@chrismackenziephotography.co.uk>
Subject Re: Started learning Hadoop. Which distribution is best for native install in pseudo distributed mode?
Date Thu, 14 Aug 2014 09:58:26 GMT
Hi,

I have been using Hadoop since Christmas loosely and from May for an
Software engineering MSc at Heriot Watt University in Edinburgh, Scotland.
I have written a genetic sequence alignment algorithm.

I have installed Hadoop in various places including a 32 node cluster and
am using eclipse kepler sr 2 as an IDE.

My current Hadoop version is 2.4.1 which I download as a tar from the
apache mirror servers.

It¹s been a tough learning curve, but that has made the learning all the
more valuable.

I believe using the straight Hadoop version has given insights that
proprietary builds wouldn¹t have. There are so many confusing issues that
crop up, it¹s easy to attach importance to trying to fix the an error
which masks another. With the proprietary versions it would be easy to
attach blame where it¹s not that build or this builds fault.

Go with your heart but be prepared to work to solve the problems you
encounter.

Buy Tom Whites book, it isn¹t perfect and a couple of years out of date
but it gives you enough detail and structure to build an impression you
can work from. The downloadable source code is a great help when trying to
get started.

Good luck.


Regards,

Chris MacKenzie
telephone: 0131 332 6967
email: studio@chrismackenziephotography.co.uk
corporate: www.chrismackenziephotography.co.uk
<http://www.chrismackenziephotography.co.uk/>
<http://plus.google.com/+ChrismackenziephotographyCoUk/posts>
<http://www.linkedin.com/in/chrismackenziephotography/>






From:  "Adaryl \"Bob\" Wakefield, MBA" <adaryl.wakefield@hotmail.com>
Reply-To:  <user@hadoop.apache.org>
Date:  Thursday, 14 August 2014 01:13
To:  <user@hadoop.apache.org>
Subject:  Re: Started learning Hadoop. Which distribution is best for
native install in pseudo distributed mode?


He didn¹t ask for the best and nobody framed up their answer like that. He
asked what people were using. Out of the 10 responses only four of them
actually 
answered his question.
 
I¹ve been studying Hadoop for two months straight. Quite frankly, I wish
more people would ask for community input and what does what and how.
 
Adaryl 
"Bob" Wakefield, MBA
Principal
Mass Street 
Analytics
913.938.6685
www.linkedin.com/in/bobwakefieldmba
Twitter: 
@BobLovesData
 
From: Kilaru, Sambaiah <mailto:Sambaiah_Kilaru@intuit.com>
Sent: Wednesday, August 13, 2014 1:10 PM
To: user@hadoop.apache.org
Subject: Re: Started learning Hadoop. Which distribution is best for
native install in pseudo distributed mode?


 

Engough wars on going on which is best. You choose one of it and try to
learn and there is nothing that x is better or y is better.
It is upto your choice.
 
Thanks,
Sam
 
From: Sebastiano Di Paola <sebastiano.dipaola@gmail.com>
Reply-To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Date: Wednesday, August 13, 2014 at 6:28
PM
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Subject: Re: Started learning Hadoop. Which
distribution is best for native install in pseudo distributed mode?

 
Hi,
I'm a newbie too and I'm not using any particular distribution. Just
download the component I need / want to try for my deploiment and use
them.

It's a slow process but allows me to better understand what I'm
doing under the hood.

Regards,
Seba



On Tue, Aug 12, 2014 at 10:12 PM, mani kandan <mankand007@gmail.com> wrote:

  Which distribution are you people using? Cloudera vs Hortonworks vs
  Biginsights? 



 



Mime
View raw message