tephra-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars Hofhansl (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TEPHRA-299) Executing a large batch delete is very slow
Date Sat, 13 Apr 2019 19:14:00 GMT
Lars Hofhansl created TEPHRA-299:

             Summary: Executing a large batch delete is very slow
                 Key: TEPHRA-299
                 URL: https://issues.apache.org/jira/browse/TEPHRA-299
             Project: Tephra
          Issue Type: Bug
    Affects Versions: 0.15.0-incubating
            Reporter: Lars Hofhansl
            Assignee: Poorna Chandra

I noticed that batch deletes are quire slow. In the profiler I found that almost all of the
time is spent in org.apache.hadoop.hbase.regionserver.wal.FSHLog.blockOnSync().

Looking at TransactionProcessor.preDelete it is obvious why:

The batch delete is translated into *single* puts that are added to the region one by one,
so each time the WAL is flushed.


This message was sent by Atlassian JIRA

View raw message