hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mahesh Balija <balijamahesh....@gmail.com>
Subject Re: Regarding Hadoop
Date Fri, 15 Feb 2013 07:28:25 GMT
Hadoop is a combination of frameworks each having its own purpose,
HDFS -> Distributed data storage, (i.e., if you have to manage huge amount
of data which may NOT fit into single machine then DFS (distributed file
system) is a way through which you can store and manage your data across
multiple machines)
Mapreduce -> Hadoop mapreduce is a parallel processing batch oriented
framework where you can run a single task in parallel across multiple
machines
Hbase -> is a distributed NOSQL data base (there are major differences b/w
a relational data base and Hbase)
and many other frameworks in the Hadoop eco-system includes Mahout, Hive,
Pig etc has their own applications.

One important note is that Hadoop run on a commodity hardware.

Best,
Mahesh Balija,
Calsoft Labs.

On Fri, Feb 15, 2013 at 12:08 PM, SrinivasaRao Kongar
<ksrinu471@gmail.com>wrote:

>
> Hi sir,
>
>   What is Hadoop technology? what is the main purpose of this Hadoop
> technology?
>
> --
>  Thanks&Regards,
> SrinivasaRao Kongara
>

Mime
View raw message