hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Gkogkritsiani <davidg...@gmail.com>
Subject Hadoop MapReduce
Date Mon, 22 Apr 2013 23:01:59 GMT
Helllo,


I have undertaken my diploma thesis on Hadoop MapReduce and I have been
requested to I do an application written in MapReduce.
I found on internet this code and I ran the code :

http://paste.ubuntu.com/5591999/

How can I add the code to stores the pages somewhere locally (text only,
not Images) and then have to be processed . ie,I should a Mapreduce code,
which would download pages from the web and store on the local file system
and not the HDFS.
After ,I run the quest (program) in order to not depend on network speed.

Because ,my network is so slow.

I do this to improvement performance.

I am running Hadoop Version 0.20.2 .
I am new to Hadoop and am kinda lost and any help would be greatly
appreciated.

Thanks in advance for any assistance !

Mime
View raw message