Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 833E5200CDA for ; Fri, 4 Aug 2017 16:36:24 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 8131916DA8D; Fri, 4 Aug 2017 14:36:24 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 9F12E16DA8A for ; Fri, 4 Aug 2017 16:36:23 +0200 (CEST) Received: (qmail 31667 invoked by uid 500); 4 Aug 2017 14:36:21 -0000 Mailing-List: contact dev-help@subversion.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list dev@subversion.apache.org Received: (qmail 31657 invoked by uid 99); 4 Aug 2017 14:36:21 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Aug 2017 14:36:21 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id F3AE61A0884 for ; Fri, 4 Aug 2017 14:36:20 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.452 X-Spam-Level: *** X-Spam-Status: No, score=3.452 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_SOFTFAIL=0.972] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=assembla-com.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id l76TorGDV0za for ; Fri, 4 Aug 2017 14:36:19 +0000 (UTC) Received: from mail-oi0-f41.google.com (mail-oi0-f41.google.com [209.85.218.41]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 1A5835FC72 for ; Fri, 4 Aug 2017 14:36:19 +0000 (UTC) Received: by mail-oi0-f41.google.com with SMTP id x3so16655509oia.1 for ; Fri, 04 Aug 2017 07:36:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=assembla-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=e2NiNgrUVTVYpflISDgfD0ewKN+uga6EXbOVR+pnbYQ=; b=LlMmaJnO1m5AGWTWVpw8DRAijEdt/K6/IzcRsJvcGIoS2XCvOAzoFwn43KVuPDY6Gq +8dEn3q2Ghx07TdLUgS8lxVesblZ6vgXTWwtQmhgiFOcjeUJ1KAEwQtwkr1xM7jo4RqT KdGCR7DsdgDtHADtc1PfUrBPvnrCTUG932y6o956sfFhuWfHGFqJpd5W4Bw78Ovig+yi MKPxGvtsJyFzysKiDtyK2hmC7HG3acsmzaqMOuSbYPsbXqYPgaVb/4q9rZ+QDJzIXjD3 VwXSIyosrZ93nRD3m4GZSltiMcRbrsIKVYa12wcCrJ6bI6bt4thQ8wyl17cwyoJhOTCW YqGw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=e2NiNgrUVTVYpflISDgfD0ewKN+uga6EXbOVR+pnbYQ=; b=Veo8b5dezqbR75zzC015e8IA0GxkJSfdOVFzyulV+vY3sEn0muITPzBXq5xMyU4oLB p1Kgx5DrkhZ6miuIR8aLRSGmH6a0NCQLreqvW0rJJtsqkNVFvYNx2yUNuANdGz574D3R gijXGkH2yz8epdBOrmgSHd3QuNR/Ykwrfw8+H4gTpiMJBEK1OajjDbKX20YYRPBG133x JQGEiCEiPw5vzBHJ3+7B/n5ULt5Hj+VgESvJzTSebIjp0FjHoVQLEsDSkWfm2mLCXWl5 N77fAj+TUc5KmjCsAz/5k5/muyp+qzgUkgAqygACbIbBAmGPM0J8ijeTAzBcybWKcjKE /Y1g== X-Gm-Message-State: AHYfb5gJYXT5ksrn7wZOwLu6zptEkd8YVefbf7NuBw6VUUgYEuFlmalC b3RhcSO2p0qDc86eLB7LI8xxS5WBpgX6 X-Received: by 10.202.235.216 with SMTP id j207mr2189369oih.282.1501857372128; Fri, 04 Aug 2017 07:36:12 -0700 (PDT) MIME-Version: 1.0 Received: by 10.182.223.97 with HTTP; Fri, 4 Aug 2017 07:35:31 -0700 (PDT) In-Reply-To: <2D4384B1-34FF-42CB-9DD1-1AA6F3A2F6A4@gmail.com> References: <2D4384B1-34FF-42CB-9DD1-1AA6F3A2F6A4@gmail.com> From: Jacek Materna Date: Fri, 4 Aug 2017 09:35:31 -0500 Message-ID: Subject: Re: [RFC] Using LZ4 compression by default To: Nathan Hartman Cc: Subversion Development Content-Type: multipart/alternative; boundary="001a113cea722b5c190555ee6b40" archived-at: Fri, 04 Aug 2017 14:36:24 -0000 --001a113cea722b5c190555ee6b40 Content-Type: text/plain; charset="UTF-8" Based on what we are seeing - disk storage is at least 10x less concerning across a huge swatch of enterprise SVN users than UX (performance and speed of developer workflows). It's becoming less and less so each quarter. Unless the disk storage increase impacts the server side performance in aggregate I don't understand why the priority (biased defaults) would not be to increase the speed of every operation. ? On Fri, Aug 4, 2017 at 9:30 AM, Nathan Hartman wrote: > On Aug 2, 2017, at 2:59 PM, Evgeny Kotkov > wrote: > > With the recently added support for LZ4 compression (r1801940 et al), > > we now have an option of using it by default for the on-disk data and > > over the wire. > > [...] > > The amount of the required additional space is proportional > > to the difference between the compression ratio of LZ4 and zlib-5, > > which can be roughly estimated as around 30-35% for compressible > > binary and text files, although that may vary depending on the > > actual data. > > > > To illustrate how these changes will affect the speed of some of the > > operations, the 'svn import' of a 2 GB file over HTTP on LAN in my > > environment takes 18 seconds instead of 63 seconds. > > Regarding on disk storage, for small repos a 30% size increase is probably > not material, but it may be significant for large repos. Is it feasible to > get the best of both worlds by using LZ4 for fast commits and then > recompress using zlib in svnadmin pack? -- Jacek Materna Chief Technology Officer Assembla +1 210 410 7661 --001a113cea722b5c190555ee6b40 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Based on what we are seeing - disk storage is at least 10x= less concerning across a huge swatch of enterprise SVN users than UX (perf= ormance and speed of developer workflows). It's becoming less and less = so each quarter. Unless the disk storage increase impacts the server side p= erformance in aggregate I don't understand why the priority (biased def= aults) would not be to increase the speed of every operation.

?

On Fri, Aug 4, 2017 at 9:30 AM, Nathan Hartman <hartman.nathan@gm= ail.com> wrote:
On Aug 2, 2017, at 2:59 PM, Evgeny Kotkov <evgeny.kotkov@visualsvn.com> wrote:
> With the recently added support for LZ4 compression (r1801940 et al),<= br> > we now have an option of using it by default for the on-disk data and<= br> > over the wire.
> [...]
> The amount of the required additional space is propor= tional
>=C2=A0 =C2=A0 =C2=A0to the difference between the compression ratio of = LZ4 and zlib-5,
>=C2=A0 =C2=A0 =C2=A0which can be roughly estimated as around 30-35% for= compressible
>=C2=A0 =C2=A0 =C2=A0binary and text files, although that may vary depen= ding on the
>=C2=A0 =C2=A0 =C2=A0actual data.
>
> To illustrate how these changes will affect the speed of some of the > operations, the 'svn import' of a 2 GB file over HTTP on LAN i= n my
> environment takes 18 seconds instead of 63 seconds.

Regarding on disk storage, for small repos a 30% size increase is pr= obably not material, but it may be significant for large repos. Is it feasi= ble to get the best of both worlds by using LZ4 for fast commits and then r= ecompress using zlib in svnadmin pack?



--

Jacek Materna
Chief Technology Officer

Assem= bla
+1 210 410 7661
--001a113cea722b5c190555ee6b40--