Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CCB7C1040B for ; Sat, 3 Jan 2015 12:46:04 +0000 (UTC) Received: (qmail 50349 invoked by uid 500); 3 Jan 2015 12:45:59 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 50238 invoked by uid 500); 3 Jan 2015 12:45:59 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 50222 invoked by uid 99); 3 Jan 2015 12:45:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 03 Jan 2015 12:45:56 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_NONE X-Spam-Check-By: apache.org Received-SPF: error (athena.apache.org: local policy) Received: from [79.99.40.135] (HELO auth-smtp-02.streamline.net) (79.99.40.135) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 03 Jan 2015 12:45:52 +0000 Received: by auth-smtp-02.streamline.net (Postfix, from userid 500) id 8033A17D7BA8; Sat, 3 Jan 2015 12:44:08 +0000 (GMT) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on auth-smtp-02.streamline.net Received: from [192.168.0.4] (unknown [94.5.24.146]) (Authenticated sender: studio@chrismackenziephotography.co.uk) by auth-smtp-02.streamline.net (Postfix) with ESMTP id 83CB117D7BA8 for ; Sat, 3 Jan 2015 12:44:04 +0000 (GMT) User-Agent: Microsoft-MacOutlook/14.4.7.141117 Date: Sat, 03 Jan 2015 12:44:03 +0000 Subject: Re: New to this group. From: Chris MacKenzie To: Message-ID: Thread-Topic: New to this group. References: <54A6DF57.6040302@gmail.com> <130F552B-8F6B-4501-B963-AC4B5B3267F2@gmail.com> In-Reply-To: Mime-version: 1.0 Content-type: multipart/alternative; boundary="B_3503133844_5464862" X-Virus-Checked: Checked by ClamAV on apache.org > This message is in MIME format. Since your mail reader does not understand this format, some or all of this message may not be legible. --B_3503133844_5464862 Content-type: text/plain; charset="ISO-8859-1" Content-transfer-encoding: quoted-printable Hi Krish, I completed an MSc project using Hadoop this summer from installation through to programming with the Java Api and then tuning. In all I did abou= t 14 weeks solid with limited unix, server experience and an academic knowledge of Java skills from my Masters course. I got an A ;O) Along the way I installed Eclipse, got Hadoop to work with it and built a genetic sequence alignment tool. It was hard work but I had a blast. I ran it on a 32 node cluster and got some good speedups. I=B9m also interested in developing my skills further and this BigPetStore application seems like a good way to go. Following my course I=B9m a trainee db admin for a global investment manager using Sybase. If you want to work on a collaborative project, I am sure I could share my Java skills and knowledge this far if you were happy to share your knowledg= e too. Why not connect on Linked In ;O) Regards, Chris MacKenzie telephone: 0131 332 6967 email: studio@chrismackenziephotography.co.uk From: Krish Donald Reply-To: Date: Friday, 2 January 2015 19:43 To: Subject: Re: New to this group. I would like to go towards administration side not in development side as I don't know java at all... On Fri, Jan 2, 2015 at 11:37 AM, Jay Vyas wrote: > Many demos out there are for the business community... >=20 > For a demonstration of hadoop at a finer grained level, how it's deployed= , > packaged, installed and used, for a developer who wants to learn hadoop "= the > hard way", =20 >=20 > I'd suggest : >=20 > 1 - Getting Apache bigtop stood up on VMs, and > 2 - running the BigPetStore application , which is meant to demonstrate e= nd to > end building testing and deployment of a hadoop batch analytics system wi= th > mapreduce, pig, and mahout. >=20 > This will also expose you to puppet, gradle, vagrant, all in a big data a= pp > which solves Real world problems like jar dependencies and multiple ecosy= stem > components. >=20 > Since BPS generates its own data, you don't waste time worrying about > external data sets, Twitter credentials, etc, and can test both on your l= aptop > and on a 100 node cluster (similar to teragen but for the whole ecosystem= ). >=20 > Since it features integration tests and tested on Bigtops hadoop distribu= tion, > (which is 100% pure Apache based), it's imo the purest learning source, n= ot > blurred with company specific downloads or branding. >=20 > Disclaimer : Of course I'm biased as I work on it... :) but we've been > working hard to make bigtop easily consumable as a gateway drug to bigdat= a > processing, and if you have solid linux and Java background, im sure othe= rs > would agree it's great place to get immersed in the hadoop ecosystem. >=20 > On Jan 2, 2015, at 1:05 PM, Krish Donald wrote: >=20 >> I would like to work on some kind of case studies like I have seen coupl= e on >> Horton works like twitter sentiment analysis, web log analysis etc. >>=20 >> But if somebody can give idea about other case studies which can be work= ed >> upon and can be put in resume later . >> As I don't have real time project experience. >>=20 >> On Fri, Jan 2, 2015 at 10:33 AM, Ted Yu wrote: >>> You can search for Open JIRAs which are related to admin. Here is an ex= ample >>> query: >>>=20 >>> https://issues.apache.org/jira/browse/HADOOP-9642?jql=3Dproject%20%3D%20H= ADOOP >>> %20AND%20status%20%3D%20Open%20AND%20text%20~%20%22admin%22 >>>=20 >>> FYI >>>=20 >>> On Fri, Jan 2, 2015 at 10:24 AM, Krish Donald wr= ote: >>>> I have fair understanding of hadoop eco system... >>>> I have setup multinode cluster using VMs in my personal laptop for Had= oop >>>> 2.0 . >>>> But beyond that i would like to work on some project to get a good hol= d on >>>> the subject. >>>>=20 >>>> I basically would like to go to into Hadoop Administartion side as my >>>> backgroud is RDBMS databases Admnistrator . >>>>=20 >>>> On Fri, Jan 2, 2015 at 10:11 AM, Wilm Schumacher >>>> wrote: >>>>> Hi, >>>>>=20 >>>>> the "standard" books may be a good start: >>>>>=20 >>>>> I liked the following >>>>>=20 >>>>> definitive guide: >>>>> http://www.amazon.de/Hadoop-Definitive-Guide-Tom-White/dp/1449311520 >>>>>=20 >>>>> hadoop in action: >>>>> http://www.manning.com/lam2/ >>>>>=20 >>>>> hadoop in practive: >>>>> http://www.manning.com/holmes2/ >>>>>=20 >>>>> A list is here: >>>>> http://wiki.apache.org/hadoop/Books >>>>>=20 >>>>> Hope this helps. >>>>>=20 >>>>> Best wishes, >>>>>=20 >>>>> Wilm >>>>>=20 >>>>> Am 02.01.2015 um 19:02 schrieb Krish Donald: >>>>>> > Hi, >>>>>> > >>>>>> > I am new to this group and hadoop. >>>>>> > Please help me to learn hadoop and suggest some self study project= . >>>>>> > >>>>>> > Thanks >>>>>> > Krish Donald >>>>>=20 >>>>=20 >>>=20 >>=20 --B_3503133844_5464862 Content-type: text/html; charset="ISO-8859-1" Content-transfer-encoding: quoted-printable
Hi Krish,

=
I completed an MSc project using Hadoop this summer from installa= tion through to programming with the Java Api and then tuning. In all I did = about 14 weeks solid with limited unix, server experience and an academic kn= owledge of Java skills from my Masters course. I got an A ;O)

=
Along the way I installed Eclipse, got Hadoop to work with it and= built a genetic sequence alignment tool. It was hard work but I had a blast= . I ran it on a 32 node cluster and got some good speedups.

I’m also interested in developing my skills further and this = BigPetStore application seems like a good way to go. Following my course I&#= 8217;m a trainee db admin for a global investment manager using Sybase.

If you want to work on a collaborative project, I am su= re I could share my Java skills and knowledge this far if you were happy to = share your knowledge too.

Why not connect on Linked= In ;O)


<= div style=3D"font-family:Calibri; font-size:11pt; text-align:left; color:black= ; BORDER-BOTTOM: medium none; BORDER-LEFT: medium none; PADDING-BOTTOM: 0in;= PADDING-LEFT: 0in; PADDING-RIGHT: 0in; BORDER-TOP: #b5c4df 1pt solid; BORDE= R-RIGHT: medium none; PADDING-TOP: 3pt">From:= Krish Donald <gotomypc27@g= mail.com>
Reply-To: <user@hadoop.apache.org>
Date:
Friday, 2 January 2015 19:43
To: <user@hadoop.apache.org>
Subj= ect: Re: New to this group.

I= would like to go towards administration side not in development side as I d= on't know java at all...

On Fri, Jan 2, 2015 at 11:37 AM, Jay Vyas <jayunit100.apache@gma= il.com> wrote:
Many demos out there are for the business community... 
For a demonstration of hadoop at a finer grained level, how it'= s deployed, packaged, installed and used, for a developer who wants to learn= hadoop "the hard way",  

I'd suggest :
<= div>
1 - Getting Apache bigtop stood up on VMs, and 
2 - running the BigPetStore application , which is meant to demonstrat= e end to end building testing and deployment of a hadoop batch analytics sys= tem with mapreduce, pig, and mahout.  

This wi= ll also expose you to puppet, gradle, vagrant, all in a big data app which s= olves Real world problems like jar dependencies and multiple ecosystem compo= nents.

Since BPS generates its own data, you don't =  waste time worrying about external data sets, Twitter credentials, etc= , and can test both on your laptop and on a 100 node cluster (similar to ter= agen but for the whole ecosystem).

Since it feature= s integration tests and tested on Bigtops hadoop distribution,  (which = is 100% pure Apache based), it's imo the purest learning source, not blurred= with company specific downloads or branding.

Discl= aimer : Of course I'm biased as I work on it... :)  but we've been work= ing hard to make bigtop easily consumable as a gateway drug to bigdata proce= ssing, and if you have solid linux and Java background, im sure others would= agree it's great place to get immersed in the hadoop ecosystem.
<= div class=3D"h5">

On Jan 2, 2015, at 1:05 PM, Krish Donald = <gotomypc27@gmail.c= om> wrote:

<= div>I would like to work on some kind of case studies like I have seen coupl= e on Horton works like twitter sentiment analysis, web log analysis etc.

But if somebody can give idea about other case studies= which can be worked upon and can be put in resume later .
As I do= n't have real time project experience.
<= br>
On Fri, Jan 2, 2015 at 10:33 AM, Ted Yu <yuzhihong@g= mail.com> wrote:

On Fri, Jan 2, 2015 at 10:24 AM, Kr= ish Donald <gotomypc27@gmail.com> wrote:
I have fair understa= nding of hadoop eco system...
I have setup multinode cluster using VMs in my personal laptop f= or Hadoop 2.0 .
But= beyond that i would like to work on some project to get a good hold on the = subject.

=
I basically would like t= o go to into Hadoop Administartion side as my backgroud is RDBMS databases A= dmnistrator .

On Fri, Jan 2, 2015 at 10:11 AM, Wilm Schumacher <wilm.schumacher@gmail= .com> wrote:
Hi,

the "standard" books may be a good start:

I liked the following

definitive guide:
http://www.amazon.de/Hadoop-Definit= ive-Guide-Tom-White/dp/1449311520

hadoop in action:
http://www.manning.com/lam2/

hadoop in practive:
http://www.manning.com/holmes2/

A list is here:
http://wiki.apache.org/hadoop/Books

Hope this helps.

Best wishes,

Wilm

Am 02.01.2015 um 19:02 schrieb Krish Donald:
> Hi,
>
> I am new to this group and hadoop.
> Please help me to learn hadoop and suggest some self study project . >
> Thanks
> Krish Donald





--B_3503133844_5464862--