hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shushant Arora <shushantaror...@gmail.com>
Subject bulk load doubts
Date Tue, 21 Jul 2015 14:20:54 GMT
1.Does bulk loaded HFile not  get replicated? Is it mean if a Regionserver
gets down , all Hfiles which were bulk loaded to this server are lost
irrespective of HDFS replication set to 3 ? if yes- Why bulk loaded HFiles
are not replicated.

2.Is there any issue in timestamp prefix as key of table- and used bulk
load for writing.

3.Does in bulk load MR job using HFileOutPutFormat2 as outputformat will
create single HFile per region ? Or it can be multiple Hfiles per region?
If multiple does loadIncrementalHFiles merges these Hfiles to 1 while
loading to same region or just do simple copy?

4.Is there any performance issue if I run bulk load every 5 sec -
containing ~20MB of data.Does it  creates frequent compactions and that
lead to performance issue?

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message