hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Virtual Hadoop" by SteveLoughran
Date Wed, 05 Oct 2011 20:56:39 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Virtual Hadoop" page has been changed by SteveLoughran:
http://wiki.apache.org/hadoop/Virtual%20Hadoop?action=diff&rev1=4&rev2=5

   1. If the cluster needs to be expanded for a longer duration, worker nodes acting as both
a DataNode and TaskTracker can be brought up.
   1. If the entire cluster goes down or restarts, all transient hard disks will be lost,
and all data stored within the HDFS cluster with it.
  
- The most significant implication is in storage. A core architectural design of both Google's
GFS and Hadoop's GFS is that three-way replication onto local storage is the lowest cost way
of storing Petabytes of data. 
+ The most significant implication is in storage. A core architectural design of both Google's
GFS and Hadoop's GFS is that three-way replication onto local storage is ''a low-cost yet
reliable way of storing Petabytes of data.''
  
  In a cloud infrastructure, this design is no longer valid. If you assume that it does, you
will be disappointed when one day all your data disappears. Please do not complain if this
happens after reading this page: you have been warned.
  

Mime
View raw message