subversion-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tony Sweeney" <tswee...@omnifone.com>
Subject RE: Trival merge of big text file: Dismal performance, 540x faster if binary.
Date Thu, 13 Jan 2011 15:58:39 GMT
Why bother with a script?  Just wget a few high traffic websites (slashdot, yahoo, dailykos,
google news) or similar into a file every now and again.

Tony. 

> -----Original Message-----
> From: Johan Corveleyn [mailto:jcorvel@gmail.com] 
> Sent: 13 January 2011 14:26
> To: krueger, Andreas (Andreas Kr├╝ger, DV-RATIO); 
> users@subversion.apache.org
> Subject: Re: Trival merge of big text file: Dismal 
> performance, 540x faster if binary.
> 
> On Thu, Jan 13, 2011 at 2:07 PM, Stefan Sperling 
> <stsp@elego.de> wrote:
> > On Thu, Jan 13, 2011 at 01:55:58PM +0100, Johan Corveleyn wrote:
> >> Textual merging in svn makes use of a variant of the standard diff 
> >> algorithm, namely diff3. Just a couple of days ago, I finally 
> >> succeeded in making diff3 take advantage of those performance 
> >> improvements (haven't committed this to the branch yet, but maybe 
> >> I'll get to it tonight).
> >>
> >> Would you be able to build an svn client from source? If so, could 
> >> you perhaps build a client from 
> >> 
> http://svn.apache.org/repos/asf/subversion/branches/diff-optimization
> >> s-bytes
> >> ?
> >
> > Hey Johan,
> >
> > I would be interested in doing testing and reviewing the changes on 
> > your branch. There might still be enough time to get them into 1.7.
> 
> Thanks, that would be great (btw, danielsh also expressed an 
> interest in reviewing the branch). I will try to give an 
> status update on the dev-list after I've committed the 
> changes for diff3.
> 
> > I don't have any suitably large XML files though.
> > If you and/or Andreas could provide some that would be great.
> 
> I was thinking of writing a python script (as philip already
> suggested) that can generate several variants of large files 
> with semi-random data. I have some prototype code for this 
> lying around, so if I find the time, I'll try to wrap this up 
> and send it to the dev list. OTOH, real-world examples are 
> probably even better.
> 
> Cheers,
> --
> Johan
> 
> ______________________________________________________________________
> This email has been scanned by the MessageLabs Email Security System.
> For more information please visit 
> http://www.messagelabs.com/email 
> ______________________________________________________________________
> 

Mime
View raw message