hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris MacKenzie <stu...@chrismackenziephotography.co.uk>
Subject Re: New to this group.
Date Sat, 03 Jan 2015 12:44:03 GMT
Hi Krish,

I completed an MSc project using Hadoop this summer from installation
through to programming with the Java Api and then tuning. In all I did about
14 weeks solid with limited unix, server experience and an academic
knowledge of Java skills from my Masters course. I got an A ;O)

Along the way I installed Eclipse, got Hadoop to work with it and built a
genetic sequence alignment tool. It was hard work but I had a blast. I ran
it on a 32 node cluster and got some good speedups.

I¹m also interested in developing my skills further and this BigPetStore
application seems like a good way to go. Following my course I¹m a trainee
db admin for a global investment manager using Sybase.

If you want to work on a collaborative project, I am sure I could share my
Java skills and knowledge this far if you were happy to share your knowledge

Why not connect on Linked In ;O)


Chris MacKenzie
telephone: 0131 332 6967
email: studio@chrismackenziephotography.co.uk

From:  Krish Donald <gotomypc27@gmail.com>
Reply-To:  <user@hadoop.apache.org>
Date:  Friday, 2 January 2015 19:43
To:  <user@hadoop.apache.org>
Subject:  Re: New to this group.

I would like to go towards administration side not in development side as I
don't know java at all...

On Fri, Jan 2, 2015 at 11:37 AM, Jay Vyas <jayunit100.apache@gmail.com>
> Many demos out there are for the business community...
> For a demonstration of hadoop at a finer grained level, how it's deployed,
> packaged, installed and used, for a developer who wants to learn hadoop "the
> hard way",  
> I'd suggest :
> 1 - Getting Apache bigtop stood up on VMs, and
> 2 - running the BigPetStore application , which is meant to demonstrate end to
> end building testing and deployment of a hadoop batch analytics system with
> mapreduce, pig, and mahout.
> This will also expose you to puppet, gradle, vagrant, all in a big data app
> which solves Real world problems like jar dependencies and multiple ecosystem
> components.
> Since BPS generates its own data, you don't  waste time worrying about
> external data sets, Twitter credentials, etc, and can test both on your laptop
> and on a 100 node cluster (similar to teragen but for the whole ecosystem).
> Since it features integration tests and tested on Bigtops hadoop distribution,
> (which is 100% pure Apache based), it's imo the purest learning source, not
> blurred with company specific downloads or branding.
> Disclaimer : Of course I'm biased as I work on it... :)  but we've been
> working hard to make bigtop easily consumable as a gateway drug to bigdata
> processing, and if you have solid linux and Java background, im sure others
> would agree it's great place to get immersed in the hadoop ecosystem.
> On Jan 2, 2015, at 1:05 PM, Krish Donald <gotomypc27@gmail.com> wrote:
>> I would like to work on some kind of case studies like I have seen couple on
>> Horton works like twitter sentiment analysis, web log analysis etc.
>> But if somebody can give idea about other case studies which can be worked
>> upon and can be put in resume later .
>> As I don't have real time project experience.
>> On Fri, Jan 2, 2015 at 10:33 AM, Ted Yu <yuzhihong@gmail.com> wrote:
>>> You can search for Open JIRAs which are related to admin. Here is an example
>>> query:
>>> https://issues.apache.org/jira/browse/HADOOP-9642?jql=project%20%3D%20HADOOP
>>> %20AND%20status%20%3D%20Open%20AND%20text%20~%20%22admin%22
>>> FYI
>>> On Fri, Jan 2, 2015 at 10:24 AM, Krish Donald <gotomypc27@gmail.com> wrote:
>>>> I have fair understanding of hadoop eco system...
>>>> I have setup multinode cluster using VMs in my personal laptop for Hadoop
>>>> 2.0 .
>>>> But beyond that i would like to work on some project to get a good hold on
>>>> the subject.
>>>> I basically would like to go to into Hadoop Administartion side as my
>>>> backgroud is RDBMS databases Admnistrator .
>>>> On Fri, Jan 2, 2015 at 10:11 AM, Wilm Schumacher
>>>> <wilm.schumacher@gmail.com> wrote:
>>>>> Hi,
>>>>> the "standard" books may be a good start:
>>>>> I liked the following
>>>>> definitive guide:
>>>>> http://www.amazon.de/Hadoop-Definitive-Guide-Tom-White/dp/1449311520
>>>>> hadoop in action:
>>>>> http://www.manning.com/lam2/
>>>>> hadoop in practive:
>>>>> http://www.manning.com/holmes2/
>>>>> A list is here:
>>>>> http://wiki.apache.org/hadoop/Books
>>>>> Hope this helps.
>>>>> Best wishes,
>>>>> Wilm
>>>>> Am 02.01.2015 um 19:02 schrieb Krish Donald:
>>>>>> > Hi,
>>>>>> >
>>>>>> > I am new to this group and hadoop.
>>>>>> > Please help me to learn hadoop and suggest some self study project
>>>>>> >
>>>>>> > Thanks
>>>>>> > Krish Donald

View raw message