accumulo-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Christopher <>
Subject Re: HDFS Replication of data
Date Thu, 10 Nov 2016 18:35:35 GMT
HDFS replication is transparent to Accumulo (though, the number of replicas
is configurable in Accumulo, on a per-table basis). Its primary purpose is
failure tolerance, but it *may* have an impact on read performance. I'm not
certain how significant that is, though.

There is no separate read-only and write-only copies of data on HDFS. HDFS
replication is at the block level, and files are updated by appending new
blocks to the files. All blocks are readable, and only new blocks are

On Thu, Nov 10, 2016 at 11:28 AM Yamini Joshi <> wrote:

> Hello all
> Does the HDFS replication improve performance of queries on Accumulo or is
> it transparent to the Accumulo system? If it does improve the performance
> by some notion of load balancing, is there is a Read Only or Write Only
> copy of data on HDFS for Accumulo?
> Best regards,
> Yamini Joshi

View raw message