harmony-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Geir Magnusson Jr." <g...@pobox.com>
Subject Re: [drlvm] Fwd: [jira] Updated: (HARMONY-2803) stress.Mix hangs on RHEL4 update 4
Date Thu, 15 Mar 2007 12:11:29 GMT

On Mar 13, 2007, at 1:24 AM, Weldon Washburn wrote:

> On 3/2/07, Geir Magnusson Jr. <geir@pobox.com> wrote:
>>
>>
>> On Feb 28, 2007, at 3:25 PM, Naveen Neelakantam wrote:
>>
>> >
>> > On Feb 28, 2007, at 3:40 PM, Weldon Washburn wrote:
>> >
>> >> Naveen,
>> >>
>> >> 1)
>> >> I tried Mix.java.load.patch on my 2-way Linux box.  It hangs
>> >> consistently.
>> >> 2)
>> >> I tried the svn HEAD version of Mix.java.  I can't get it to
>> >> fail.  Can you
>> >> do an "svn update" and see if the baseline Mix.java causes your
>> >> regression
>> >> tests to hang?
>> >
>> > I used "repeat 100 java -cp . stress.Mix" and the test still hangs
>> > for me.
>> >
>> >> My guess is that Mix.java.load.patch is somehow triggering
>> >> classloader/vm
>> >> synchronization bugs.  I worry that we will spend a bunch of time
>> >> fixing
>> >> legitimate bugs with no specific focus on getting chosen  
>> enterprise
>> >> workloads running.  Unless an enterprise app exhibits a very high
>> >> rate of
>> >> class loading, I'd like to set this bug aside for now.  What do
>> >> you think?
>> >
>> > I think you are correct.  We should make sure some app demands  
>> high-
>> > rate class loading before chasing this bug down.
>>
>> I don't agree.  I think we should at least understand it (that
>> doesn't mean fix it if it's hard), but understand why it's
>> happening.  Even a good faith "I'll spend 2 days trying to figure
>> this out, but no more" might well be worth it.
>
>
> Geir as the say in Apache-land, "thank you for volunteering".  I  
> have looked
> and satisfied my own curiosity.  It would be great if you do the  
> same and
> report back.

Why don't you tell us what you found?  Do you understand what the  
problem is beyond some unknown bugs?

I can also tell you that they are being cause by unknown bugs.

>
> Nothing scares me more than sweeping unaccountable failures under the
>> rug.
>
>
> Geir, with all due respect these are the kinds of problems one  
> observes with
> early stage systems software. DRLVM is a construction zone.  Its
> unreasonable to expect to fix roofing problems when we are still  
> working on
> the foundation.

Do you really think that core synchronization issues are the  
construction-equivalent to hanging curtains or choosing floor tiles?

I don't.

I think that this kind of thing is core to the foundation.  Better  
done earlier than later.

geir

>  Believe me, even if it the problem where "swept under the
> rug", it will come bouncing back when we try to run advanced  
> workloads.
> Nothing is being hidden here.  If you want to follow where the  
> threading
> focus is right now, take a look at H3288 and H3289.

>
>
>
>
> geir
>>
>>
>
>
> -- 
> Weldon Washburn
> Intel Enterprise Solutions Software Division


Mime
View raw message