Hi all,

I have understood the Hadoop and Hadoop Ecosystem(Pig as ETL, Hive as DataWare house, Sqoop as importing tool). I worked and learned on single node cluster with demo data.

As Hadoop suits best on Unix platform. Please help me to understand the requirement form start to finish to use Hadoop in production.

What would be the things to use Hadoop on real time project.

like Hadoop automation on Unix, alert of failure process.

Please put some light on using Hadoop on real time and what objectives are recommended.


Thanks & Regards
Yogesh Kumar