subversion-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stefan Fuhrmann <stefan.fuhrm...@wandisco.com>
Subject Re: Numbers encoding in FSFS log addressing indexes
Date Wed, 25 Jun 2014 15:34:43 GMT
On Wed, Jun 25, 2014 at 5:09 PM, Ivan Zhakov <ivan@visualsvn.com> wrote:

> Subversion 1.8 and before in general uses human readable decimal
> format to store numbers in FSFS repositories on disk.


True. However, there are exceptions to that general rule.
The index data uses the same basic encoding as we
already use in txdelta. In both cases, encoding density
is critical I/O performance.

For instance, if you disable deltification in the ruby repo
(but keeping compression active), it explodes to 9.7GB,
a factor of 22.8. From that it should be obvious how
important space efficient encoding is to Subversion.


> Log addressing
> implementation on trunk introduces new encoding for storing numbers in
> indexes. Quoting log addressing indexes format documentation [1]
>

I'm not even sure there is documentation for our txdelta
on-disk representation. So, FSFS indexes are doing a
better job in that department, ATM.

-- Stefan^2.

Mime
View raw message