Here's an extremely na´ve ballpark estimation: at theoretical hardware speed, for 3PB representing 1PB with 3x replication
Over a single 1Gbps connection (and I'm not sure, you can actually reach 1Gbps)
(3 petabytes) / (1 Gbps) = 291.271111 days
So you'd need at least 40,000 1Gbps network cards to get that in 10 minutes :) - (3PB/1Gbps)/40000
The actual number of nodes would depend a lot on the actual network architecture, the type of storage you use (SSD, HDD), etc.
Please clarify the below questions.
1. With in 10 minutes one petabyte of data load into HDFS/HIVE , how many slave (Data Nodes) machines required.
2. With in 10 minutes one petabyte of data load into HDFS/HIVE, what is the configuration setup for cloud computing.
Please suggest and help me on this.