hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Allan Yang" <allan...@163.com>
Subject Re:Whether turning off assignSeqIds using hbase bulk load will cause adverse impact or not
Date Wed, 01 Mar 2017 11:25:15 GMT

Please refer to https://issues.apache.org/jira/browse/HBASE-10958
Assigning seqids to bulk loaded files is to avoid data loss when replay log.
If you are 1. all data are imported by bulkload other than insert or
 2. seqids in the loaded file are guaranteed small than the data in the memory
you can get rid of the assignSeqId process.

Allan Yang

At 2017-03-01 16:18:02, "沈国权" <shenguoquan1986@gmail.com> wrote:
>Hi community:
>    Currently our environment will import about 200G into HBase each day.
>These datas are cross six tables which table has about 100 Regions. We use
>HBase bulk tools completed these import
>tasks. We investigate HBase source code found when bulk loading task
>happen, HRegionServer will at firstly flush memstore according to each
>HRegion, We want to turn off assignSeqIds avoid each HRegion to force flush
>memstore. But We don't know these adverse impact if we do. Please help me
>to point out these impacts. Thanks
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message