hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From lars hofhansl <la...@apache.org>
Subject Re: HBase without compactions?
Date Tue, 19 Feb 2013 08:12:10 GMT
If you store data in LSM trees you need compactions.
The advantage is that your data files are immutable.
MapR has a mutable file system and they probably store their data in something more akin to
Or maybe they somehow avoid the expensive merge sorting of many small files. It seems that
is has to be one or the other.

(Maybe somebody from MapR reads this and can explain how it actually works.)

Compations let you trade random IO for sequential IO (just to state the obvious). It seems
that you can't have it both ways.

-- Lars

 From: Otis Gospodnetic <otis.gospodnetic@gmail.com>
To: user@hbase.apache.org 
Sent: Monday, February 18, 2013 7:30 PM
Subject: HBase without compactions?

It's kind of funny, we run SPM, which includes SPM for HBase (performance
monitoring service/tool for HBase essentially) and we currently store all
performance metrics in HBase.

I see a ton of HBase development activity, which is great, but it just
occurred to me that I don't think I recall seeing anything about getting
rid of compactions.  Yet, compactions are one thing that I know hurt us the
most and is one thing that MapR somehow got rid of in their implementation.

Have there been any discussions,attempts, or thoughts about finding a way
to avoid compactions?

HBASE Performance Monitoring - http://sematext.com/spm/index.html
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message