Return-Path: Delivered-To: apmail-subversion-users-archive@minotaur.apache.org Received: (qmail 96246 invoked from network); 13 Jan 2011 14:26:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 13 Jan 2011 14:26:25 -0000 Received: (qmail 60838 invoked by uid 500); 13 Jan 2011 14:26:24 -0000 Delivered-To: apmail-subversion-users-archive@subversion.apache.org Received: (qmail 60070 invoked by uid 500); 13 Jan 2011 14:26:22 -0000 Mailing-List: contact users-help@subversion.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list users@subversion.apache.org Received: (qmail 60061 invoked by uid 99); 13 Jan 2011 14:26:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Jan 2011 14:26:21 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jcorvel@gmail.com designates 209.85.214.171 as permitted sender) Received: from [209.85.214.171] (HELO mail-iw0-f171.google.com) (209.85.214.171) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Jan 2011 14:26:16 +0000 Received: by iwn2 with SMTP id 2so1656255iwn.16 for ; Thu, 13 Jan 2011 06:25:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=I2vghjLpGu0x2GMLrtTu/+hqhTtCaqt7gGCC3sWntAg=; b=l69vERhDQPBqhPFIJBg39Qfcl+li9fGtzH09EYijBHvO9bpzqT5OqvMEY9qYcVkeIW BanpVCnRuvTmJauF8synQC5/TXqjPPKXzRvIC78BuqyzRDTDYcwBxSKpFlSMzhL8gR8Q 9BwS2zHjt4VKhu6lJfmAgtavZYY6evwLy2IF8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=fB8XQoHeFbhIAt3hUnxkm4OAtT/+6AQ351j2rbZacf5cVRPS6ng6bl92Tnumt18T8y TvmGSNkoGpF5vC4hed8VUDX0kdiLcAQkADp7LVDH9PA8pIozVlOrCZI5UrtYGTJ2TRJm 8mHTGEnvn9XffZUD2pLbyUpfhOTKmjP+3fXX0= Received: by 10.231.16.67 with SMTP id n3mr2453504iba.66.1294928755863; Thu, 13 Jan 2011 06:25:55 -0800 (PST) MIME-Version: 1.0 Received: by 10.231.14.75 with HTTP; Thu, 13 Jan 2011 06:25:35 -0800 (PST) In-Reply-To: <20110113130734.GC263@jack.stsp.name> References: <20110113130734.GC263@jack.stsp.name> From: Johan Corveleyn Date: Thu, 13 Jan 2011 15:25:35 +0100 Message-ID: Subject: Re: Trival merge of big text file: Dismal performance, 540x faster if binary. To: =?ISO-8859-1?Q?krueger=2C_Andreas_=28Andreas_Kr=FCger=2C_DV=2DRATIO=29?= , "users@subversion.apache.org" Content-Type: text/plain; charset=ISO-8859-1 On Thu, Jan 13, 2011 at 2:07 PM, Stefan Sperling wrote: > On Thu, Jan 13, 2011 at 01:55:58PM +0100, Johan Corveleyn wrote: >> Textual merging in svn makes use of a variant of the standard diff >> algorithm, namely diff3. Just a couple of days ago, I finally >> succeeded in making diff3 take advantage of those performance >> improvements (haven't committed this to the branch yet, but maybe I'll >> get to it tonight). >> >> Would you be able to build an svn client from source? If so, could you >> perhaps build a client from >> http://svn.apache.org/repos/asf/subversion/branches/diff-optimizations-bytes >> ? > > Hey Johan, > > I would be interested in doing testing and reviewing the changes > on your branch. There might still be enough time to get them into 1.7. Thanks, that would be great (btw, danielsh also expressed an interest in reviewing the branch). I will try to give an status update on the dev-list after I've committed the changes for diff3. > I don't have any suitably large XML files though. > If you and/or Andreas could provide some that would be great. I was thinking of writing a python script (as philip already suggested) that can generate several variants of large files with semi-random data. I have some prototype code for this lying around, so if I find the time, I'll try to wrap this up and send it to the dev list. OTOH, real-world examples are probably even better. Cheers, -- Johan