Mailing-List: contact dev-help@subversion.apache.org; run by ezmlm
Precedence: bulk
Received-SPF: pass (athena.apache.org: domain of stefan.fuhrmann@wandisco.com
 designates 209.85.213.181 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <20141007112748.GC17513@ted.stsp.name>
References: <20141007104114.C0A3D2388993@eris.apache.org>
	<20141007112748.GC17513@ted.stsp.name>
Date: Tue, 7 Oct 2014 15:02:41 +0200
Message-ID: 
 <CA+t0gk0mDxBgXdfSJgbBA+M7gVL4cbMdgf4G5xp4Y_8xJDjWRg@mail.gmail.com>
Subject: Re: svn commit: r1629854 - in
 /subversion/trunk/subversion/libsvn_fs_fs:
 fs.c fs_fs.c
From: Stefan Fuhrmann <stefan.fuhrmann@wandisco.com>
To: Subversion Development <dev@subversion.apache.org>
Content-Type: multipart/alternative; boundary=089e013a22708134160504d4d079

--089e013a22708134160504d4d079
Content-Type: text/plain; charset=UTF-8

On Tue, Oct 7, 2014 at 1:27 PM, Stefan Sperling <stsp@elego.de> wrote:

> On Tue, Oct 07, 2014 at 10:41:14AM -0000, stefan2@apache.org wrote:
> > Author: stefan2
> > Date: Tue Oct  7 10:41:14 2014
> > New Revision: 1629854
> >
> > URL: http://svn.apache.org/r1629854
> > Log:
> > In FSFS, always use the same function to read the 'current' file.
> >
> > Apart from the consistency aspect, this no longer lets atoi() mask
> > 'current' file corruptions.  Recovery must be adopted to this.
>
> Hi Stefan,
>
> Two questions below:
>
> > --- subversion/trunk/subversion/libsvn_fs_fs/fs.c (original)
> > +++ subversion/trunk/subversion/libsvn_fs_fs/fs.c Tue Oct  7 10:41:14
> 2014
> > @@ -348,20 +349,47 @@ fs_open_for_recovery(svn_fs_t *fs,
> >                       apr_pool_t *pool,
> >                       apr_pool_t *common_pool)
> >  {
> > +  svn_error_t * err;
> > +  svn_revnum_t youngest_rev;
> > +  apr_pool_t * subpool = svn_pool_create(pool);
> > +
> >    /* Recovery for FSFS is currently limited to recreating the 'current'
> >       file from the latest revision. */
> >
> >    /* The only thing we have to watch out for is that the 'current' file
> > -     might not exist.  So we'll try to create it here unconditionally,
> > -     and just ignore any errors that might indicate that it's already
> > -     present. (We'll need it to exist later anyway as a source for the
> > -     new file's permissions). */
> > +     might not exist or contain garbage.  So we'll try to read it here
> > +     and provide or replace the existing file if we couldn't read it.
> > +     (We'll also need it to exist later anyway as a source for the new
> > +     file's permissions). */
> >
> > -  /* Use a partly-filled fs pointer first to create 'current'.  This
> will fail
> > -     if 'current' already exists, but we don't care about that. */
> > +  /* Use a partly-filled fs pointer first to create 'current'. */
> >    fs->path = apr_pstrdup(fs->pool, path);
> > -  svn_error_clear(svn_io_file_create(svn_fs_fs__path_current(fs, pool),
> > -                                     "0 1 1\n", pool));
> > +
> > +  SVN_ERR(initialize_fs_struct(fs));
>
> The 'fs' struct is provided by the caller and is now initialised
> and uninitialised within this function. Can't this function
> use a local 'fs' variable? If not, why does it need to be uninitialised
> again? This is a bit confusing -- though perhaps it's an idiom used in
> the FS code that I'm not aware of?
>

The code would actually be nicer if it used a temporary FS.
After all, we are only trying to fix / prepare the on-disk data
before properly open the repo and trying to recover it.

However, svn_fs_new() is deprecated and there is no nice
alternative. We could use apr_pmemdup, write our own init
code or so but all these approaches are slightly fragile and
blur the lines between libsvn_fs and libsvn_fs_fs.


> > +  /* Figure out the repo format and check that we can even handle it. */
> > +  SVN_ERR(svn_fs_fs__read_format_file(fs, subpool));
> > +
> > +  /* Now, read 'current' and try to patch it if necessary. */
> > +  err = svn_fs_fs__youngest_rev(&youngest_rev, fs, subpool);
> > +  if (err)
>
> Can't we check for a specific error code here, and return the
> error otherwise? This would make the intention of the error handling
> code explicit and avoid masking of arbitrary error conditions.
>

If we wanted to enumerate all "unsurprising" error conditions,
it might become quite a long list. After all, things are most
likely broken when you run recover. To me, it seems best
to try to get into a working state *despite* any previous errors.
r1629879 tries to explain that.

-- Stefan^2.

--089e013a22708134160504d4d079
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote">On T=
ue, Oct 7, 2014 at 1:27 PM, Stefan Sperling <span dir=3D"ltr">&lt;<a href=
=3D"mailto:stsp@elego.de" target=3D"_blank">stsp@elego.de</a>&gt;</span> wr=
ote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex=
;border-left:1px solid rgb(204,204,204);padding-left:1ex">On Tue, Oct 07, 2=
014 at 10:41:14AM -0000, <a href=3D"mailto:stefan2@apache.org">stefan2@apac=
he.org</a> wrote:<br>
&gt; Author: stefan2<br>
&gt; Date: Tue Oct=C2=A0 7 10:41:14 2014<br>
&gt; New Revision: 1629854<br>
&gt;<br>
&gt; URL: <a href=3D"http://svn.apache.org/r1629854" target=3D"_blank">http=
://svn.apache.org/r1629854</a><br>
&gt; Log:<br>
&gt; In FSFS, always use the same function to read the &#39;current&#39; fi=
le.<br>
&gt;<br>
&gt; Apart from the consistency aspect, this no longer lets atoi() mask<br>
&gt; &#39;current&#39; file corruptions.=C2=A0 Recovery must be adopted to =
this.<br>
<br>
Hi Stefan,<br>
<br>
Two questions below:<br>
<br>
&gt; --- subversion/trunk/subversion/libsvn_fs_fs/fs.c (original)<br>
&gt; +++ subversion/trunk/subversion/libsvn_fs_fs/fs.c Tue Oct=C2=A0 7 10:4=
1:14 2014<br>
&gt; @@ -348,20 +349,47 @@ fs_open_for_recovery(svn_fs_t *fs,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0apr_pool_t *pool,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0apr_pool_t *common_pool)<br>
&gt;=C2=A0 {<br>
&gt; +=C2=A0 svn_error_t * err;<br>
&gt; +=C2=A0 svn_revnum_t youngest_rev;<br>
&gt; +=C2=A0 apr_pool_t * subpool =3D svn_pool_create(pool);<br>
&gt; +<br>
&gt;=C2=A0 =C2=A0 /* Recovery for FSFS is currently limited to recreating t=
he &#39;current&#39;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0file from the latest revision. */<br>
&gt;<br>
&gt;=C2=A0 =C2=A0 /* The only thing we have to watch out for is that the &#=
39;current&#39; file<br>
&gt; -=C2=A0 =C2=A0 =C2=A0might not exist.=C2=A0 So we&#39;ll try to create=
 it here unconditionally,<br>
&gt; -=C2=A0 =C2=A0 =C2=A0and just ignore any errors that might indicate th=
at it&#39;s already<br>
&gt; -=C2=A0 =C2=A0 =C2=A0present. (We&#39;ll need it to exist later anyway=
 as a source for the<br>
&gt; -=C2=A0 =C2=A0 =C2=A0new file&#39;s permissions). */<br>
&gt; +=C2=A0 =C2=A0 =C2=A0might not exist or contain garbage.=C2=A0 So we&#=
39;ll try to read it here<br>
&gt; +=C2=A0 =C2=A0 =C2=A0and provide or replace the existing file if we co=
uldn&#39;t read it.<br>
&gt; +=C2=A0 =C2=A0 =C2=A0(We&#39;ll also need it to exist later anyway as =
a source for the new<br>
&gt; +=C2=A0 =C2=A0 =C2=A0file&#39;s permissions). */<br>
&gt;<br>
&gt; -=C2=A0 /* Use a partly-filled fs pointer first to create &#39;current=
&#39;.=C2=A0 This will fail<br>
&gt; -=C2=A0 =C2=A0 =C2=A0if &#39;current&#39; already exists, but we don&#=
39;t care about that. */<br>
&gt; +=C2=A0 /* Use a partly-filled fs pointer first to create &#39;current=
&#39;. */<br>
&gt;=C2=A0 =C2=A0 fs-&gt;path =3D apr_pstrdup(fs-&gt;pool, path);<br>
&gt; -=C2=A0 svn_error_clear(svn_io_file_create(svn_fs_fs__path_current(fs,=
 pool),<br>
&gt; -=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0=
 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0&quot;0 1 1\=
n&quot;, pool));<br>
&gt; +<br>
&gt; +=C2=A0 SVN_ERR(initialize_fs_struct(fs));<br>
<br>
The &#39;fs&#39; struct is provided by the caller and is now initialised<br=
>
and uninitialised within this function. Can&#39;t this function<br>
use a local &#39;fs&#39; variable? If not, why does it need to be uninitial=
ised<br>
again? This is a bit confusing -- though perhaps it&#39;s an idiom used in<=
br>
the FS code that I&#39;m not aware of?<br></blockquote><div><br></div><div>=
The code would actually be nicer if it used a temporary FS.<br>After all, w=
e are only trying to fix / prepare the on-disk data<br>before properly open=
 the repo and trying to recover it.<br><br></div><div>However, svn_fs_new()=
 is deprecated and there is no nice<br></div><div>alternative. We could use=
 apr_pmemdup, write our own init<br>code or so but all these approaches are=
 slightly fragile and<br></div><div>blur the lines between libsvn_fs and li=
bsvn_fs_fs.<br></div><div>=C2=A0</div><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);paddi=
ng-left:1ex">&gt; +=C2=A0 /* Figure out the repo format and check that we c=
an even handle it. */<br>
&gt; +=C2=A0 SVN_ERR(svn_fs_fs__read_format_file(fs, subpool));<br>
&gt; +<br>
&gt; +=C2=A0 /* Now, read &#39;current&#39; and try to patch it if necessar=
y. */<br>
&gt; +=C2=A0 err =3D svn_fs_fs__youngest_rev(&amp;youngest_rev, fs, subpool=
);<br>
&gt; +=C2=A0 if (err)<br>
<br>
Can&#39;t we check for a specific error code here, and return the<br>
error otherwise? This would make the intention of the error handling<br>
code explicit and avoid masking of arbitrary error conditions.<br></blockqu=
ote><div><br></div><div>If we wanted to enumerate all &quot;unsurprising&qu=
ot; error conditions,<br>it might become quite a long list. After all, thin=
gs are most<br></div><div>likely broken when you run recover. To me, it see=
ms best<br>to try to get into a working state *despite* any previous errors=
.<br>r1629879 tries to explain that.<span style=3D"color:rgb(0,0,0);font-fa=
mily:Sans;font-size:medium;font-style:normal;font-variant:normal;font-weigh=
t:normal;letter-spacing:normal;line-height:normal;text-align:start;text-ind=
ent:0px;text-transform:none;white-space:pre-wrap;word-spacing:0px;backgroun=
d-color:rgb(173,189,200);display:inline!important;float:none"></span></div>=
<br></div>-- Stefan^2.<br></div></div>

--089e013a22708134160504d4d079--