Sean Boudreau(deleted)
|
Re: OS lockup do to Apache 22
|
Sean Boudreau(deleted)
07/31/2009 11:18 AM
post35054
|
Re: OS lockup do to Apache 22
On Fri, Jul 31, 2009 at 10:39:51AM -0400, Greg Morehead wrote:
>
> We have been attempting to identify a system lockup for a few months.
>
> Long story short, we found with slinger running everything was stable.
>
> We have suspected apache but had no real proof this was the root cause. Finally I captured what I believe to be
definitive evidence.
>
> After the system locked up, I power cycled and reviewed all the logs. The last log on the system was apache
attempting to serve up the includes from a page request.
>
> In ssl_request_log, I found that only 11 of 21 include files had been served up. This corresponds with the fact that
I was waiting forever for my page to load.
>
>
> This basically kills apache as on option on our system. Does anyone have a suggestion for a ssl compliant replacement
???
>
I'd like to try to debug the lockup. Anything on
the (text mode) console? Can you try running a
high prio shell and look around while in this state?
What version / platform? SMP?
Regards,
-seanb
|
|
|
Mitchell Schoenbrun
|
Re: OS lockup do to Apache 22
|
Mitchell Schoenbrun
07/31/2009 1:01 PM
post35079
|
Re: OS lockup do to Apache 22
I have a minor interest in this as I run Apache 22. In the past I
setup SSL and have tested it, but don't run anything against it
normally. I don't own a registered certificate.
Reading the info I've been trying to think a few things through. In
general, open source like Apache and SSL have to be very high quality.
They get run all over the place all the time. On top of that, at
least where I am, they get compiled with GCC another open source. So
the code being produced should be quite predicable in some sense and
should work quite predictably.
So where may the problem dwell?
1) It could be a bug in either Apache or SSL. Very unlikely but possible.
2) It could be some subtle difference in the way some QNX system call works.
3) It could be, as Sean alludes to, related to thread interaction on
QNX running multi-processor.
4) It could be a site specific anomaly.
An example of 4) would be a file to be served that can't be read
properly on disk.
I've put these in the order that I think is increasing likely hood.
If it will be of any help, I would be willing to put the SSL pages on
my system where we could see if the problem reproduces here. If the
problem shows up, then we've at least eliminated 4) and I could look a
little closer at 1-3. There's nothing crucial running on my system
so I can tolerate it crashing while we look.
Mitchell
|
|
|