hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rong-en Fan" <gra...@gmail.com>
Subject compression in HBase
Date Thu, 10 Jul 2008 14:51:58 GMT
I'm reading

http://jimbojw.com/wiki/index.php?title=Understanding_HBase_column-family_performance_options

but get confused about BLOCK and RECORD compression. In my
understanding, the these two options govern the underlying MapFile's
data file, which is a SequenceFile. In HBase, each key in the SequenceFile
is actually row/column/ts. So, specifying RECORD means each
value in *one* row/column/ts is compressed. With BLOCK, it
may cover the same row (since one row may have more than one
row/column/ts keys in the underlying MapFile). If this is correct,
then I don't get the point mentioned in the wiki above.

Any ideas?

Thanks,
Rong-En Fan

Mime
View raw message