accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] [accumulo] belugabehr opened a new pull request #1328: ACCUMULO-1416: Remove FileSystemMonitor
Date Fri, 16 Aug 2019 17:46:17 GMT
belugabehr opened a new pull request #1328: ACCUMULO-1416: Remove FileSystemMonitor
URL: https://github.com/apache/accumulo/pull/1328
 
 
   Recently lost a TabletServer because a single data drive failed.  In a large cluster, failures
of data drives are assumed and the software must support this.  All of the file system interactions
should be through the Hadoop FileSystem API which works with HDFS in production, so there
should be no direct-disk access for the TabletServer to care about.  This PR removes this
FileSystemMonitor class that checks for dead drives and halts the TableServer if it finds
any.  Reading the JIRAs, it seems historically there are issues when the OS disk dies, but
that is usually not a common issue because the OS drive is often RAID-1.
   
   https://blog.cloudera.com/how-to-deploy-apache-hadoop-clusters-like-a-boss/
   
   Please bring into 1.x and 2.x branches

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

Mime
View raw message