httpd-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeff Trawick <traw...@gmail.com>
Subject Re: [users@httpd] Windows Apache 2.4.9 restarts itself
Date Tue, 29 Jul 2014 11:40:50 GMT
On Tue, Jul 29, 2014 at 5:33 AM, Mar Imp <marimpimp@gmail.com> wrote:

> Hi Jeff,
>
> After the first restart, apache restarted itself 2 more times
> yesterday. Good news: the additional information IS part of the logs:
> [Mon Jul 28 16:48:10.070355 2014] [mpm_winnt:crit] [pid 38752:tid 388]
> (OS 6)The handle is invalid.  : AH02644: Child: Event handle #0 (8) is
> invalid
> does it help you to identify the issue?
>

Unfortunately it does not immediately pinpoint the problem, but it can't
hurt :)  8 is the value for the handle that suddenly went bad.

At the moment I think that the occurrence of multiple types of
handle-related errors is the biggest clue.  Consider your prior report
where accept failed due to EINVAL, or 10022.

[Tue Jul 22 12:18:28.260820 2014] [mpm_winnt:notice] [pid 33168:tid
456] AH00354: Child: Starting 300 worker threads.^M
The Apache2.4 service is running.
[Mon Jul 28 12:12:32.738706 2014] [mpm_winnt:warn] [pid 33168:tid
5356] (OS 10022)An invalid argument was supplied.  : AH00344: accept()
failed.

On the surface, that should be completely unrelated to this issue where I
added the extra logging.  The documented reason on Windows for accept()
failing with EINVAL is when listen() wasn't invoked on the socket.  But
listen() is handled at child process initialization, and the child process
seems to have been active for 6 days (?) prior to the error, and the thread
would have been calling accept() the entire time.  Perhaps what happens is
that the socket handle is overwritten with something else that fails the
is-socket-listening test but doesn't fail any other checks that Windows may
perform.

--/--

Do you have vhost-specific error logs?  Can you check if something out of
the ordinary was logged in those error logs prior to this issue getting
reported in the main server?

What third-party modules are being used?

Maybe I should search again, but AFAIK there are still just a few Apache
httpd admins that have encountered this.  There may be a clue in there
somehow, besides the rarity.


>
> (it seems the "shutting down server" message gets stripped in case it
> is the first message in a newly created logfile -> this could be
> another bug)
>

Some of the lost messaging may be specific to using rotatelogs (unverified
theory ;) )  But I am aware that several messages like "The Apache2.4
service is running.", shown in the log snippet above, clobber the error log
too due to the use of different I/O mechanisms which don't coordinate on
buffering, whether or not rotatelogs is used.



> following I post the whole logfile after [Mon Jul 28 12:13:36.854715 2014]
> :
> [Mon Jul 28 12:13:36.854715 2014] [mpm_winnt:notice] [pid 38752:tid
> 388] AH00354: Child: Starting 300 worker threads.
> [Mon Jul 28 16:48:10.070355 2014] [mpm_winnt:crit] [pid 38752:tid 388]
> (OS 6)The handle is invalid.  : AH00356: Child: WAIT_FAILED --
> shutting down server
> [Mon Jul 28 16:48:10.070355 2014] [mpm_winnt:crit] [pid 38752:tid 388]
> (OS 6)The handle is invalid.  : AH02644: Child: Event handle #0 (8) is
> invalid
> [Mon Jul 28 16:48:10.129413 2014] [mpm_winnt:warn] [pid 38752:tid
> 4580] (OS 10038)An operation was attempted on something that is not a
> socket.  : AH00344: accept() failed.
> [Mon Jul 28 16:48:33.621926 2014] [mpm_winnt:warn] [pid 38752:tid
> 3456] (OS 10038)An operation was attempted on something that is not a
> socket.  : AH00344: accept() failed.
> [Mon Jul 28 16:48:42.016238 2014] [mpm_winnt:notice] [pid 38752:tid
> 388] AH00362: Child: Waiting 30 more seconds for 7 worker threads to
> finish.
> [Mon Jul 28 16:49:04.344906 2014] [mpm_winnt:notice] [pid 38752:tid
> 388] AH00364: Child: All worker threads have exited.
> [Mon Jul 28 16:49:04.464020 2014] [mpm_winnt:notice] [pid 13424:tid
> 536] AH00428: Parent: child process 38752 exited with status 0 --
> Restarting.
> [Mon Jul 28 16:49:05.165695 2014] [mpm_winnt:notice] [pid 13424:tid
> 536] AH00455: Apache/2.4.10 (Win64) OpenSSL/1.0.1h configured --
> resuming normal operations
> [Mon Jul 28 16:49:05.165695 2014] [mpm_winnt:notice] [pid 13424:tid
> 536] AH00456: Apache Lounge VC11 Server built: Jul 17 2014 12:58:29
> [Mon Jul 28 16:49:05.165695 2014] [core:notice] [pid 13424:tid 536]
> AH00094: Command line: 'C:\\Apache24\\bin\\httpd.exe -d C:/Apache24'
> [Mon Jul 28 16:49:05.166693 2014] [mpm_winnt:notice] [pid 13424:tid
> 536] AH00418: Parent: Created child process 33896
> [Mon Jul 28 16:49:06.463938 2014] [mpm_winnt:notice] [pid 33896:tid
> 384] AH00354: Child: Starting 300 worker threads.
> [Mon Jul 28 16:49:19.480318 2014] [mpm_winnt:crit] [pid 33896:tid 384]
> (OS 6)The handle is invalid.  : AH00356: Child: WAIT_FAILED --
> shutting down server
> [Mon Jul 28 16:49:19.480318 2014] [mpm_winnt:crit] [pid 33896:tid 384]
> (OS 6)The handle is invalid.  : AH02644: Child: Event handle #0 (8) is
> invalid
> [Mon Jul 28 16:49:20.201009 2014] [mpm_winnt:warn] [pid 33896:tid
> 4380] (OS 10038)An operation was attempted on something that is not a
> socket.  : AH00344: accept() failed.
> [Mon Jul 28 16:49:33.565195 2014] [mpm_winnt:warn] [pid 33896:tid
> 3404] (OS 10038)An operation was attempted on something that is not a
> socket.  : AH00344: accept() failed.
> [Mon Jul 28 16:49:51.421208 2014] [mpm_winnt:notice] [pid 33896:tid
> 384] AH00362: Child: Waiting 30 more seconds for 4 worker threads to
> finish.
> [Mon Jul 28 16:50:21.455441 2014] [mpm_winnt:notice] [pid 33896:tid
> 384] AH00362: Child: Waiting 0 more seconds for 1 worker threads to
> finish.
> [Mon Jul 28 16:50:21.555537 2014] [mpm_winnt:notice] [pid 33896:tid
> 384] AH00363: Child: Terminating 1 threads that failed to exit.
> [Mon Jul 28 16:50:21.555537 2014] [mpm_winnt:notice] [pid 33896:tid
> 384] AH00364: Child: All worker threads have exited.
> [Mon Jul 28 16:50:21.667649 2014] [mpm_winnt:notice] [pid 13424:tid
> 536] AH00428: Parent: child process 33896 exited with status 0 --
> Restarting.
> [Mon Jul 28 16:50:22.399347 2014] [mpm_winnt:notice] [pid 13424:tid
> 536] AH00455: Apache/2.4.10 (Win64) OpenSSL/1.0.1h configured --
> resuming normal operations
> [Mon Jul 28 16:50:22.399347 2014] [mpm_winnt:notice] [pid 13424:tid
> 536] AH00456: Apache Lounge VC11 Server built: Jul 17 2014 12:58:29
> [Mon Jul 28 16:50:22.399347 2014] [core:notice] [pid 13424:tid 536]
> AH00094: Command line: 'C:\\Apache24\\bin\\httpd.exe -d C:/Apache24'
> [Mon Jul 28 16:50:22.399347 2014] [mpm_winnt:notice] [pid 13424:tid
> 536] AH00418: Parent: Created child process 34808
> [Mon Jul 28 16:50:23.736629 2014] [mpm_winnt:notice] [pid 34808:tid
> 464] AH00354: Child: Starting 300 worker threads.
>
> 2014-07-28 12:44 GMT+02:00 Mar Imp <marimpimp@gmail.com>:
> > Hi Jeff,
> >
> > I installed 2.4.10 on 22.07.2014 and it was running fine for 6 days.
> > Today again a restart happened -> again without the important row in
> > logfile:
> >
> > these are the logs:
> >
> > Starting the Apache2.4 service
> > [Tue Jul 22 12:18:26.942556 2014] [mpm_winnt:notice] [pid 13424:tid
> > 536] AH00455: Apache/2.4.10 (Win64) OpenSSL/1.0.1h configured --
> > resuming normal operations^M
> > [Tue Jul 22 12:18:26.942556 2014] [mpm_winnt:notice] [pid 13424:tid
> > 536] AH00456: Apache Lounge VC11 Server built: Jul 17 2014 12:58:29^M
> > [Tue Jul 22 12:18:26.942556 2014] [core:notice] [pid 13424:tid 536]
> > AH00094: Command line: 'C:\\Apache24\\bin\\httpd.exe -d C:/Apache24'^M
> > [Tue Jul 22 12:18:26.942556 2014] [mpm_winnt:notice] [pid 13424:tid
> > 536] AH00418: Parent: Created child process 33168^M
> > [Tue Jul 22 12:18:28.260820 2014] [mpm_winnt:notice] [pid 33168:tid
> > 456] AH00354: Child: Starting 300 worker threads.^M
> > The Apache2.4 service is running.
> > [Mon Jul 28 12:12:32.738706 2014] [mpm_winnt:warn] [pid 33168:tid
> > 5356] (OS 10022)An invalid argument was supplied.  : AH00344: accept()
> > failed.
> > [Mon Jul 28 12:13:04.673283 2014] [mpm_winnt:notice] [pid 33168:tid
> > 456] AH00362: Child: Waiting 30 more seconds for 8 worker threads to
> > finish.
> > [Mon Jul 28 12:13:34.708657 2014] [mpm_winnt:notice] [pid 33168:tid
> > 456] AH00362: Child: Waiting 0 more seconds for 3 worker threads to
> > finish.
> > [Mon Jul 28 12:13:34.808754 2014] [mpm_winnt:notice] [pid 33168:tid
> > 456] AH00363: Child: Terminating 3 threads that failed to exit.
> > [Mon Jul 28 12:13:34.808754 2014] [mpm_winnt:notice] [pid 33168:tid
> > 456] AH00364: Child: All worker threads have exited.
> > [Mon Jul 28 12:13:34.928869 2014] [mpm_winnt:notice] [pid 13424:tid
> > 536] AH00428: Parent: child process 33168 exited with status 0 --
> > Restarting.
> > [Mon Jul 28 12:13:35.592504 2014] [mpm_winnt:notice] [pid 13424:tid
> > 536] AH00455: Apache/2.4.10 (Win64) OpenSSL/1.0.1h configured --
> > resuming normal operations
> > [Mon Jul 28 12:13:35.592504 2014] [mpm_winnt:notice] [pid 13424:tid
> > 536] AH00456: Apache Lounge VC11 Server built: Jul 17 2014 12:58:29
> > [Mon Jul 28 12:13:35.592504 2014] [core:notice] [pid 13424:tid 536]
> > AH00094: Command line: 'C:\\Apache24\\bin\\httpd.exe -d C:/Apache24'
> > [Mon Jul 28 12:13:35.593505 2014] [mpm_winnt:notice] [pid 13424:tid
> > 536] AH00418: Parent: Created child process 38752
> > [Mon Jul 28 12:13:36.854715 2014] [mpm_winnt:notice] [pid 38752:tid
> > 388] AH00354: Child: Starting 300 worker threads.
> >
> > -> I'm afraid the patch indeed introduced a bug. the "shutting down
> > server" message is missing in the logfile.
> > Any ideas?
> >
> > 2014-07-17 21:07 GMT+02:00 Jeff Trawick <trawick@gmail.com>:
> >> On Thu, Jul 17, 2014 at 10:07 AM, Mar Imp <marimpimp@gmail.com> wrote:
> >>>
> >>> Hi Jeff,
> >>>
> >>> no, there is nothing before. "error_20140716_000000.log" (yesterday)
> ends
> >>> with:
> >>>
> >>> [Wed Jul 16 10:17:02.436304 2014] [mpm_winnt:notice] [pid 35212:tid
> >>> 384] AH00354: Child: Starting 300 worker threads.
> >>> The Apache2.4 service is running.
> >>>
> >>> and "error_20140717_000000.log" (today) starts with:
> >>> [Thu Jul 17 15:12:13.091225 2014] [mpm_winnt:notice] [pid 35212:tid
> >>> 384] AH00362: Child: Waiting 30 more seconds for 16 worker threads to
> >>> finish.
> >>>
> >>>
> >>> :(
> >>
> >>
> >> I think you got cheated out of some log messages.
> >>
> >> The patch shouldn't hurt anyone barring a compiler bug (famous last
> words).
> >> It logs some extra information in the error path you reported
> originally:
> >>
> >> [Fri May 23 07:53:58.344457 2014] [mpm_winnt:crit] [pid 42540:tid 380]
> >> (OS 6)The handle is invalid.  : AH00356: Child: WAIT_FAILED --
> >> shutting down server
> >>
> >> 2.4.10 will have the same logic for everyone on Windows.  Hopefully we
> can
> >> get more information from someone.
> >>
> >>
> >>
> >>>
> >>>
> >>>
> >>> 2014-07-17 15:59 GMT+02:00 Jeff Trawick <trawick@gmail.com>:
> >>> > On Thu, Jul 17, 2014 at 9:33 AM, Mar Imp <marimpimp@gmail.com>
> wrote:
> >>> >>
> >>> >> Hi,
> >>> >>
> >>> >> I got a new build from apachelounge. Yesterday I run the server, and
> >>> >> today apache restarted itself. Unfortunately the logfile says even
> >>> >> less than before:
> >>> >>
> >>> >> [Thu Jul 17 15:12:13.091225 2014] [mpm_winnt:notice] [pid 35212:tid
> >>> >> 384] AH00362: Child: Waiting 30 more seconds for 16 worker threads
> to
> >>> >> finish.
> >>> >
> >>> >
> >>> > Is there nothing before this message?
> >>> >
> >>> >>
> >>> >> [Thu Jul 17 15:12:26.333012 2014] [mpm_winnt:warn] [pid 35212:tid
> >>> >> 4288] (OS 10038)An operation was attempted on something that is not
> a
> >>> >> socket.  : AH00344: accept() failed.
> >>> >> [Thu Jul 17 15:12:43.127156 2014] [mpm_winnt:notice] [pid 35212:tid
> >>> >> 384] AH00362: Child: Waiting 0 more seconds for 4 worker threads to
> >>> >> finish.
> >>> >> [Thu Jul 17 15:12:43.227264 2014] [mpm_winnt:notice] [pid 35212:tid
> >>> >> 384] AH00363: Child: Terminating 4 threads that failed to exit.
> >>> >> [Thu Jul 17 15:12:43.227264 2014] [mpm_winnt:notice] [pid 35212:tid
> >>> >> 384] AH00364: Child: All worker threads have exited.
> >>> >> [Thu Jul 17 15:12:43.344363 2014] [mpm_winnt:notice] [pid 36412:tid
> >>> >> 536] AH00428: Parent: child process 35212 exited with status 0 --
> >>> >> Restarting.
> >>> >> [Thu Jul 17 15:12:44.020013 2014] [mpm_winnt:notice] [pid 36412:tid
> >>> >> 536] AH00455: Apache/2.4.9 (Win64) OpenSSL/1.0.1h configured --
> >>> >> resuming normal operations
> >>> >> [Thu Jul 17 15:12:44.020013 2014] [mpm_winnt:notice] [pid 36412:tid
> >>> >> 536] AH00456: Apache Lounge VC11 Server built: Jul  9 2014 14:14:36
> >>> >> [Thu Jul 17 15:12:44.020013 2014] [core:notice] [pid 36412:tid 536]
> >>> >> AH00094: Command line: 'C:\\Apache24\\bin\\httpd.exe -d C:/Apache24'
> >>> >> [Thu Jul 17 15:12:44.021014 2014] [mpm_winnt:notice] [pid 36412:tid
> >>> >> 536] AH00418: Parent: Created child process 25540
> >>> >> [Thu Jul 17 15:12:45.335272 2014] [mpm_winnt:notice] [pid 25540:tid
> >>> >> 384] AH00354: Child: Starting 300 worker threads.
> >>> >>
> >>> >>
> >>> >> -> the first line with the reason for the server restart ("The
> handle
> >>> >> is invalid.  : AH00356: Child: WAIT_FAILED -- shutting down server")
> >>> >> is not in the logfile at all!
> >>> >> is there a new bug introduced?
> >>> >>
> >>> >> thx for your help
> >>> >>
> >>> >> 2014-07-09 13:48 GMT+02:00 Jeff Trawick <trawick@gmail.com>:
> >>> >> > On Wed, Jul 9, 2014 at 7:36 AM, Mar Imp <marimpimp@gmail.com>
> wrote:
> >>> >> >>
> >>> >> >> hi,
> >>> >> >>
> >>> >> >> this is the answer I got from an apachelounge guy:
> >>> >> >>
> >>> >> >>
> >>> >> >> Does not build with the patch at
> >>> >> >>
> >>> >> >> http://svn.apache.org/viewvc?view=revision&revision=1606368
> >>> >> >>
> >>> >> >>
> >>> >> >>
> >>> >> >> Error    85    error C2065: 'my_generation' : undeclared
> identifier
> >>> >> >>
> >>> >> >> E:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    1051
> >>> >> >>
> >>> >> >> Error    86    error C2065: 'my_generation' : undeclared
> identifier
> >>> >> >>
> >>> >> >> E:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    1076
> >>> >> >>
> >>> >> >>
> >>> >> >
> >>> >> >
> >>> >> > Interesting...   It looks like they picked up other code from
> httpd
> >>> >> > trunk
> >>> >> > instead of just picking up that one patch.  (my_generation is in
> >>> >> > trunk
> >>> >> > but
> >>> >> > not part of that patch.)
> >>> >> >
> >>> >> > As it turns out, the patch is now integrated into the 2.4.x branch
> >>> >> > for
> >>> >> > 2.4.10 (coming up in the next week or two).  They can use this
> file
> >>> >> > unchanged/unpatched with 2.4.9:
> >>> >> >
> >>> >> >
> >>> >> >
> >>> >> >
> http://svn.apache.org/viewvc/httpd/httpd/branches/2.4.x/server/mpm/winnt/child.c?view=co
> >>> >> >
> >>> >> > or just wait for 2.4.10.
> >>> >> >
> >>> >> > Thanks for trying!
> >>> >> >
> >>> >> >
> >>> >> >>
> >>> >> >> 138    IntelliSense: a value of type "void *" cannot be assigned
> to
> >>> >> >> an
> >>> >> >>
> >>> >> >> entity of type "core_server_config *"
> >>> >> >>
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    315
> >>> >> >>
> >>> >> >> 139    IntelliSense: a value of type "void *" cannot be assigned
> to
> >>> >> >> an
> >>> >> >>
> >>> >> >> entity of type "char *"
> >>> >> >>
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    458
> >>> >> >>
> >>> >> >> 140    IntelliSense: a value of type "void *" cannot be assigned
> to
> >>> >> >> an
> >>> >> >>
> >>> >> >> entity of type "sockaddr *"
> >>> >> >>
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    632
> >>> >> >>
> >>> >> >> 141    IntelliSense: a value of type "void *" cannot be assigned
> to
> >>> >> >> an
> >>> >> >>
> >>> >> >> entity of type "sockaddr *"
> >>> >> >>
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    635
> >>> >> >>
> >>> >> >> 142    IntelliSense: a value of type "void *" cannot be used to
> >>> >> >> initialize
> >>> >> >>
> >>> >> >> an entity of type "winnt_conn_ctx_t *"
> >>> >> >>
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    789
> >>> >> >>
> >>> >> >> 143    IntelliSense: a value of type "PVOID" cannot be assigned
> to
> >>> >> >> an
> >>> >> >> entity
> >>> >> >>
> >>> >> >> of type "apr_bucket *"
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c
> >>> >> >>
> >>> >> >> 791
> >>> >> >>
> >>> >> >> 144    IntelliSense: a value of type "PVOID" cannot be assigned
> to
> >>> >> >> an
> >>> >> >> entity
> >>> >> >>
> >>> >> >> of type "apr_bucket *"
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c
> >>> >> >>
> >>> >> >> 842
> >>> >> >>
> >>> >> >> 145    IntelliSense: a value of type "HANDLE" cannot be assigned
> to
> >>> >> >> an
> >>> >> >>
> >>> >> >> entity of type "HANDLE *"
> >>> >> >>
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    1040
> >>> >> >>
> >>> >> >> 146    IntelliSense: identifier "my_generation" is undefined
> >>> >> >>
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    1051
> >>> >> >>
> >>> >> >> 147    IntelliSense: identifier "my_generation" is undefined
> >>> >> >>
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c    1076
> >>> >> >>
> >>> >> >> 148    IntelliSense: a value of type "void *" cannot be assigned
> to
> >>> >> >> an
> >>> >> >>
> >>> >> >> entity of type "int *"
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c
> >>> >> >>
> >>> >> >> 1082
> >>> >> >>
> >>> >> >> 149    IntelliSense: a value of type "void *" cannot be assigned
> to
> >>> >> >> an
> >>> >> >>
> >>> >> >> entity of type "int *"
> >>> >> >> e:\VC11\Win64\httpd-2.4.9\server\mpm\winnt\child.c
> >>> >> >>
> >>> >> >> 1330
> >>> >> >>
> >>> >> >>
> >>> >> >>
> >>> >> >>
> >>> >> >> 2014-06-28 17:06 GMT+02:00 Jeff Trawick <trawick@gmail.com>:
> >>> >> >>
> >>> >> >>> On Fri, Jun 27, 2014 at 7:53 AM, Jeff Trawick <
> trawick@gmail.com>
> >>> >> >>> wrote:
> >>> >> >>>>
> >>> >> >>>> On Thu, Jun 26, 2014 at 4:18 AM, Mar Imp <marimpimp@gmail.com>
> >>> >> >>>> wrote:
> >>> >> >>>>>
> >>> >> >>>>> Hi,
> >>> >> >>>>>
> >>> >> >>>>> Thank you for the response!
> >>> >> >>>>>
> >>> >> >>>>> I looked into child.c, and see that the first line in Error
> log:
> >>> >> >>>>> (OS 6)The handle is invalid.  : AH00356: Child: WAIT_FAILED --
> >>> >> >>>>> shutting down server
> >>> >> >>>>> comes from:
> >>> >> >>>>> child.c Line 1125+: rv = WaitForMultipleObjects(num_events,
> >>> >> >>>>> (HANDLE
> >>> >> >>>>> *)child_events, FALSE, ……
> >>> >> >>>>> This is the place that causes the restart.
> >>> >> >>>>
> >>> >> >>>>
> >>> >> >>>> Right...  The other code I pointed out is what makes this call
> >>> >> >>>> different
> >>> >> >>>> than before.
> >>> >> >>>>
> >>> >> >>>>
> >>> >> >>>>>
> >>> >> >>>>>
> >>> >> >>>>> The second line in the error log:
> >>> >> >>>>> (OS 10038)An operation was attempted on something that is not
> a
> >>> >> >>>>> socket.  : AH00344: accept() failed.
> >>> >> >>>>> comes from:
> >>> >> >>>>> child.c Line 641+: if (context->accept_socket ==
> INVALID_SOCKET)
> >>> >> >>>>> I guess the second line in the error log is just a
> consequence of
> >>> >> >>>>> the
> >>> >> >>>>> server shutting down?
> >>> >> >>>>
> >>> >> >>>>
> >>> >> >>>> I expect so.
> >>> >> >>>>
> >>> >> >>>>>
> >>> >> >>>>>
> >>> >> >>>>>
> >>> >> >>>>> no, I can’t build httpd. It would be great if you could
> provide a
> >>> >> >>>>> patched version that I could install in _production_
> environment.
> >>> >> >>>>>
> >>> >> >>>>> I can’t reproduce the error on a test server, I can just
> install
> >>> >> >>>>> the
> >>> >> >>>>> version on a productive server, and wait for the error to
> occur.
> >>> >> >>>>> The
> >>> >> >>>>> error occurs sporadically.
> >>> >> >>>>>
> >>> >> >>>>> It is essential that the patch included in the build doesn’t
> >>> >> >>>>> break
> >>> >> >>>>> things (particularly the ability of apache to restart when an
> >>> >> >>>>> error
> >>> >> >>>>> happens).
> >>> >> >>>>
> >>> >> >>>>
> >>> >> >>>> The Apache Lounge folks would need to run the build for you to
> >>> >> >>>> ensure
> >>> >> >>>> that it has any patches you're currently using and that the
> >>> >> >>>> toolchain
> >>> >> >>>> is in
> >>> >> >>>> sync.
> >>> >> >>>>
> >>> >> >>>> Right now I hope to make some permanent changes in the next
> couple
> >>> >> >>>> of
> >>> >> >>>> days to improve diagnostics in this area and commit the
> changes to
> >>> >> >>>> httpd
> >>> >> >>>> trunk.  At that point you might be able to get the Apache
> Lounge
> >>> >> >>>> folks to
> >>> >> >>>> roll them into a special build.  ???
> >>> >> >>>>
> >>> >> >>>
> >>> >> >>> Here are the diagnostic changes I added:
> >>> >> >>>
> >>> >> >>> http://svn.apache.org/viewvc?view=revision&revision=1606368
> >>> >> >>>
> >>> >> >>> Perhaps the Apache Lounge folks can get you a build of 2.4.9
> with
> >>> >> >>> that
> >>> >> >>> in
> >>> >> >>> it.
> >>> >> >>>
> >>> >> >>>>
> >>> >> >>>>
> >>> >> >>>>>
> >>> >> >>>>>
> >>> >> >>>>> As you found out, I use the apachelounge-build atm.
> >>> >> >>>>>
> >>> >> >>>>> Thank you for your help!
> >>> >> >>>>>
> >>> >> >>>>> Kind regards
> >>> >> >>>>> Maria Imp
> >>> >> >>>>>
> >>> >> >>>>> 2014-06-24 15:59 GMT+02:00 Jeff Trawick <trawick@gmail.com>:
> >>> >> >>>>> > On Tue, Jun 24, 2014 at 9:36 AM, Mar Imp <
> marimpimp@gmail.com>
> >>> >> >>>>> > wrote:
> >>> >> >>>>> >>
> >>> >> >>>>> >> I face the problem, that Apache restarts itself
> unexpecteldy
> >>> >> >>>>> >> without
> >>> >> >>>>> >> obvious reason.
> >>> >> >>>>> >>
> >>> >> >>>>> >> -> (OS 6)The handle is invalid.  : AH00356: Child:
> WAIT_FAILED
> >>> >> >>>>> >> --
> >>> >> >>>>> >> shutting down server
> >>> >> >>>>> >> -> (OS 10038)An operation was attempted on something that
> is
> >>> >> >>>>> >> not
> >>> >> >>>>> >> a
> >>> >> >>>>> >> socket.  : AH00344: accept() failed.
> >>> >> >>>>> >>
> >>> >> >>>>> >> Apache Version:
> >>> >> >>>>> >>   * Problem occurs on 2.4.9, both VC10 and VC11 compiled
> ones
> >>> >> >>>>> >>   * Problem does NOT occur on 2.4.4
> >>> >> >>>>> >> Operating System:
> >>> >> >>>>> >>   * Microsoft Windows Server 2012 Standard
> >>> >> >>>>> >>
> >>> >> >>>>> >> The problem occurs since the Apache upgrade from 2.4.4 to
> >>> >> >>>>> >> 2.4.9.
> >>> >> >>>>> >> At this moment I reverted to 2.4.4 again because the random
> >>> >> >>>>> >> restarts
> >>> >> >>>>> >> cause downtimes.
> >>> >> >>>>> >>
> >>> >> >>>>> >>
> >>> >> >>>>> >> Relevant configuration settings:
> >>> >> >>>>> >> ThreadsPerChild 300
> >>> >> >>>>> >> AcceptFilter http none
> >>> >> >>>>> >> AcceptFilter https none
> >>> >> >>>>> >> EnableSendfile off
> >>> >> >>>>> >> EnableMMAP off
> >>> >> >>>>> >>
> >>> >> >>>>> >>
> >>> >> >>>>> >> Apache works as a caching reverse proxy (mod_cache_disk)
> >>> >> >>>>> >>
> >>> >> >>>>> >>
> >>> >> >>>>> >> Typical logfile fragment:
> >>> >> >>>>> >> [log]
> >>> >> >>>>> >> [Fri May 23 07:53:58.344457 2014] [mpm_winnt:crit] [pid
> >>> >> >>>>> >> 42540:tid
> >>> >> >>>>> >> 380]
> >>> >> >>>>> >> (OS 6)The handle is invalid.  : AH00356: Child:
> WAIT_FAILED --
> >>> >> >>>>> >> shutting down server
> >>> >> >>>>> >> [Fri May 23 07:54:03.757607 2014] [mpm_winnt:warn] [pid
> >>> >> >>>>> >> 42540:tid
> >>> >> >>>>> >> 4212] (OS 10038)An operation was attempted on something
> that
> >>> >> >>>>> >> is
> >>> >> >>>>> >> not
> >>> >> >>>>> >> a
> >>> >> >>>>> >> socket.  : AH00344: accept() failed.
> >>> >> >>>>> >> [Fri May 23 07:54:30.288807 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 42540:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 270 more seconds for 5 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 07:55:00.329552 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 42540:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 240 more seconds for 1 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 07:55:30.370221 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 42540:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 210 more seconds for 1 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 07:55:47.893020 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 42540:tid
> >>> >> >>>>> >> 380] AH00364: Child: All worker threads have exited.
> >>> >> >>>>> >> [Fri May 23 07:55:48.010131 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 36544:tid
> >>> >> >>>>> >> 516] AH00428: Parent: child process 42540 exited with
> status 0
> >>> >> >>>>> >> --
> >>> >> >>>>> >> Restarting.
> >>> >> >>>>> >> [Fri May 23 07:55:48.801890 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 36544:tid
> >>> >> >>>>> >> 516] AH00455: Apache/2.4.9 (Win64) OpenSSL/1.0.1g
> configured
> >>> >> >>>>> >> --
> >>> >> >>>>> >> resuming normal operations
> >>> >> >>>>> >> [Fri May 23 07:55:48.801890 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 36544:tid
> >>> >> >>>>> >> 516] AH00456: Apache Lounge VC10 Server built: Mar 17 2014
> >>> >> >>>>> >> 12:11:31
> >>> >> >>>>> >> [Fri May 23 07:55:48.801890 2014] [core:notice] [pid
> 36544:tid
> >>> >> >>>>> >> 516]
> >>> >> >>>>> >> AH00094: Command line: 'C:\\Apache24\\bin\\httpd.exe -d
> >>> >> >>>>> >> C:/Apache24'
> >>> >> >>>>> >> [Fri May 23 07:55:48.802891 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 36544:tid
> >>> >> >>>>> >> 516] AH00418: Parent: Created child process 33888
> >>> >> >>>>> >> [Fri May 23 07:55:50.263291 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380] AH00354: Child: Starting 300 worker threads.
> >>> >> >>>>> >> [Fri May 23 11:58:09.232804 2014] [mpm_winnt:crit] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380]
> >>> >> >>>>> >> (OS 6)The handle is invalid.  : AH00356: Child:
> WAIT_FAILED --
> >>> >> >>>>> >> shutting down server
> >>> >> >>>>> >> [Fri May 23 11:58:10.710220 2014] [mpm_winnt:warn] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 5008] (OS 10038)An operation was attempted on something
> that
> >>> >> >>>>> >> is
> >>> >> >>>>> >> not
> >>> >> >>>>> >> a
> >>> >> >>>>> >> socket.  : AH00344: accept() failed.
> >>> >> >>>>> >> [Fri May 23 11:58:28.546422 2014] [mpm_winnt:warn] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 3320] (OS 10038)An operation was attempted on something
> that
> >>> >> >>>>> >> is
> >>> >> >>>>> >> not
> >>> >> >>>>> >> a
> >>> >> >>>>> >> socket.  : AH00344: accept() failed.
> >>> >> >>>>> >> [Fri May 23 11:58:41.169814 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 270 more seconds for 8 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 11:59:11.208219 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 240 more seconds for 3 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 11:59:41.245781 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 210 more seconds for 1 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 12:00:11.277011 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 180 more seconds for 1 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 12:00:41.305795 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 150 more seconds for 1 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 12:01:11.335526 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380] AH00362: Child: Waiting 120 more seconds for 1 worker
> >>> >> >>>>> >> threads
> >>> >> >>>>> >> to
> >>> >> >>>>> >> finish.
> >>> >> >>>>> >> [Fri May 23 12:01:15.840009 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 33888:tid
> >>> >> >>>>> >> 380] AH00364: Child: All worker threads have exited.
> >>> >> >>>>> >> [Fri May 23 12:01:15.953118 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 36544:tid
> >>> >> >>>>> >> 516] AH00428: Parent: child process 33888 exited with
> status 0
> >>> >> >>>>> >> --
> >>> >> >>>>> >> Restarting.
> >>> >> >>>>> >> [Fri May 23 12:01:16.820952 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 36544:tid
> >>> >> >>>>> >> 516] AH00455: Apache/2.4.9 (Win64) OpenSSL/1.0.1g
> configured
> >>> >> >>>>> >> --
> >>> >> >>>>> >> resuming normal operations
> >>> >> >>>>> >> [Fri May 23 12:01:16.820952 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 36544:tid
> >>> >> >>>>> >> 516] AH00456: Apache Lounge VC10 Server built: Mar 17 2014
> >>> >> >>>>> >> 12:11:31
> >>> >> >>>>> >> [Fri May 23 12:01:16.820952 2014] [core:notice] [pid
> 36544:tid
> >>> >> >>>>> >> 516]
> >>> >> >>>>> >> AH00094: Command line: 'C:\\Apache24\\bin\\httpd.exe -d
> >>> >> >>>>> >> C:/Apache24'
> >>> >> >>>>> >> [Fri May 23 12:01:16.820952 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 36544:tid
> >>> >> >>>>> >> 516] AH00418: Parent: Created child process 24388
> >>> >> >>>>> >> [Fri May 23 12:01:18.308375 2014] [mpm_winnt:notice] [pid
> >>> >> >>>>> >> 24388:tid
> >>> >> >>>>> >> 376] AH00354: Child: Starting 300 worker threads.
> >>> >> >>>>> >> [/log]
> >>> >> >>>>> >>
> >>> >> >>>>> >> Have you got any idea, why this can happen?
> >>> >> >>>>> >>
> >>> >> >>>>> >> In the windows eventlog I can't find any relevant entries
> at
> >>> >> >>>>> >> the
> >>> >> >>>>> >> time
> >>> >> >>>>> >> that apache restarts.
> >>> >> >>>>> >>
> >>> >> >>>>> >> Please help if you can
> >>> >> >>>>> >>
> >>> >> >>>>> >> Kind Regards
> >>> >> >>>>> >> Maria Imp
> >>> >> >>>>> >>
> >>> >> >>>>> >>
> >>> >> >>>>> >>
> >>> >> >>>>> >>
> >>> >> >>>>> >>
> ---------------------------------------------------------------------
> >>> >> >>>>> >> To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
> >>> >> >>>>> >> For additional commands, e-mail:
> users-help@httpd.apache.org
> >>> >> >>>>> >>
> >>> >> >>>>> >
> >>> >> >>>>> > I just found this thread too, which also contains this
> >>> >> >>>>> > information
> >>> >> >>>>> > from you
> >>> >> >>>>> > a few weeks ago:
> >>> >> >>>>> >
> >>> >> >>>>> > https://www.apachelounge.com/viewtopic.php?p=27848
> >>> >> >>>>> >
> >>> >> >>>>> > The apparent code change that triggered this is that the
> httpd
> >>> >> >>>>> > child
> >>> >> >>>>> > now
> >>> >> >>>>> > detects when the httpd parent crashes or is terminated
> >>> >> >>>>> > forcefully.
> >>> >> >>>>> > It would
> >>> >> >>>>> > appear that this process handle is no longer valid after
> some
> >>> >> >>>>> > period
> >>> >> >>>>> > of
> >>> >> >>>>> > time:
> >>> >> >>>>> >
> >>> >> >>>>> >  child_events[2] = OpenProcess(SYNCHRONIZE, FALSE,
> parent_pid);
> >>> >> >>>>> >  num_events = 3;
> >>> >> >>>>> >
> >>> >> >>>>> > Some of the other logs in the apachelounge.com post show
> that
> >>> >> >>>>> > the
> >>> >> >>>>> > parent was
> >>> >> >>>>> > able to successfully restart the child after this error, so
> >>> >> >>>>> > presumably the
> >>> >> >>>>> > handle doesn't go bad simply because the parent exits.
> >>> >> >>>>> >
> >>> >> >>>>> > Are you able to build httpd yourself?  I could prepare a
> test
> >>> >> >>>>> > patch
> >>> >> >>>>> > to
> >>> >> >>>>> > narrow down what is going wrong.
> >>> >> >>>>> >
> >>> >> >>>>> > --
> >>> >> >
> >>> >> >
> >>> >>
> >>> >>
> ---------------------------------------------------------------------
> >>> >> To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
> >>> >> For additional commands, e-mail: users-help@httpd.apache.org
> >>> >>
> >>> >
> >>> >
> >>> >
> >>> > --
> >>> > Born in Roswell... married an alien...
> >>> > http://emptyhammock.com/
> >>> > http://edjective.org/
> >>> >
> >>>
> >>> ---------------------------------------------------------------------
> >>> To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
> >>> For additional commands, e-mail: users-help@httpd.apache.org
> >>>
> >>
> >>
> >>
> >> --
> >> Born in Roswell... married an alien...
> >> http://emptyhammock.com/
> >> http://edjective.org/
> >>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
> For additional commands, e-mail: users-help@httpd.apache.org
>
>


-- 
Born in Roswell... married an alien...
http://emptyhammock.com/
http://edjective.org/

Mime
View raw message