incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jonathan Ellis <jbel...@gmail.com>
Subject Re: SIGSEGV during compaction?
Date Sun, 11 Sep 2011 22:35:39 GMT
The problem would be because we're trying to access a mappedbytebuffer
that has been unmapped, not because of OS deletion semantics.

On Sun, Sep 11, 2011 at 2:12 PM, Yang <teddyyyy123@gmail.com> wrote:
> unfortunately it appeared again, I confirmed it appeared with the
> -XX:-UseCompressedOops  .
>
> so, if it's due to accessing unmapped file, the problem should
> disappear if I use direct random access file, I'll try that.
>
> also if a process A (not only java) has a file mmapped, and another
> process deletes the file, I thought Unix should keep the file open for
> A, no ??  if this is true, then we shouldn't get a SEGV due to forcing
> unmmap ????
>
>
> Thanks
> Yang
> On Thu, Sep 8, 2011 at 12:07 AM, Sylvain Lebresne <sylvain@datastax.com> wrote:
>> Are you using current trunk ? Or 0.8 ?
>>
>> Because if on trunk, a SIGSEGV could also be due to CASSANDRA-2521,
>> if we happen to force the unmapping of a file but tries to access it afterwards
>> (which shouldn't happen but ...).
>>
>> --
>> Sylvain
>>
>> On Thu, Sep 8, 2011 at 7:36 AM, Yang <teddyyyy123@gmail.com> wrote:
>>> hmmmm, all other things remaining the same, I put jna.jar into classpath,
>>> now it successfully completed a compaction without problems
>>>
>>> On Wed, Sep 7, 2011 at 10:06 PM, Yang <teddyyyy123@gmail.com> wrote:
>>>> thanks Jonathan.
>>>>
>>>> I tried openJdk too, same , filed bug to both Oracle and openJdk
>>>>
>>>>
>>>> tried -XX:-UseCompressedOops , same SEGV
>>>>
>>>> Oracle bug site asks "does it appear with -server and -Xint", I tried
>>>> these options, so far no SEGV yet, maybe slower, but haven't measured
>>>> exactly
>>>>
>>>>
>>>>
>>>> On Wed, Sep 7, 2011 at 8:56 PM, Jonathan Ellis <jbellis@gmail.com>
wrote:
>>>>> You should report a bug to Oracle.
>>>>>
>>>>> In the meantime you could try turning off compressed oops -- that's
>>>>> been a source of a lot of GC bugs in the past.
>>>>>
>>>>> On Wed, Sep 7, 2011 at 8:22 PM, Yang <teddyyyy123@gmail.com> wrote:
>>>>>> some info in the debug file that JVM exported:
>>>>>>
>>>>>> #
>>>>>> # A fatal error has been detected by the Java Runtime Environment:
>>>>>> #
>>>>>> #  SIGSEGV (0xb) at pc=0x00002aaaab37cbfa, pid=7236, tid=1179806016
>>>>>> #
>>>>>> # JRE version: 6.0_27-b07
>>>>>> # Java VM: Java HotSpot(TM) 64-Bit Server VM (20.2-b06 mixed mode
>>>>>> linux-amd64 compressed oops)
>>>>>> # Problematic frame:
>>>>>> # J  com.cgm.whisky.filter.WithinLimitIpFrequencyCap.isValid(Lorg/apache/avro/specific/SpecificRecord;)Lcom/cgm/whisky/EventsFilter$ValidityCode;
>>>>>> #
>>>>>> # If you would like to submit a bug report, please visit:
>>>>>> #   http://java.sun.com/webapps/bugreport/crash.jsp
>>>>>> #
>>>>>>
>>>>>> ---------------  T H R E A D  ---------------
>>>>>>
>>>>>> Current thread (0x00002aaab80e2800):  JavaThread "pool-3-thread-8"
>>>>>> [_thread_in_Java, id=7669,
>>>>>> stack(0x0000000046426000,0x0000000046527000)]
>>>>>>
>>>>>> siginfo:si_signo=SIGSEGV: si_errno=0, si_code=1 (SEGV_MAPERR),
>>>>>> si_addr=0x00002aaabc000000
>>>>>>
>>>>>> Registers:
>>>>>> RAX=0x00000007914355e8, RBX=0x000000000000058a,
>>>>>> RCX=0x0000000791461b38, RDX=0x0000000000000000
>>>>>> RSP=0x00000000465259f0, RBP=0x00000000f222b894,
>>>>>> RSI=0x0000000791433f20, RDI=0x00002aaaab37ca60
>>>>>> R8 =0x00000000d0931f61, R9 =0x00000000f2286ab2,
>>>>>> R10=0x0000000000000000, R11=0x00002aaabc000000
>>>>>> R12=0x0000000000000000, R13=0x00000000465259f0,
>>>>>> R14=0x0000000000000002, R15=0x00002aaab80e2800
>>>>>> RIP=0x00002aaaab37cbfa, EFLAGS=0x0000000000010202,
>>>>>> CSGSFS=0x0100000000000033, ERR=0x0000000000000004
>>>>>>  TRAPNO=0x000000000000000e
>>>>>>
>>>>>> Top of Stack: (sp=0x00000000465259f0)
>>>>>> 0x00000000465259f0:   000000068a828dc8 0000000791433f20
>>>>>> 0x0000000046525a00:   000000079145ee60 000005890000058a
>>>>>>
>>>>>>
>>>>>> On Wed, Sep 7, 2011 at 6:21 PM, Yang <teddyyyy123@gmail.com>
wrote:
>>>>>>> I started compaction using nodetool,
>>>>>>> then always reproducibly, I get a SEGV in a code that I added
to the
>>>>>>> Cassandra code, which simply calls get_slice().
>>>>>>>
>>>>>>> have you seen SEGV associated with compaction? anyone could suggest
a
>>>>>>> route on how to debug this?
>>>>>>>
>>>>>>> I filed a bug on sun website, right now the only possible approach
I
>>>>>>> can try is to use another JDK
>>>>>>>
>>>>>>>
>>>>>>> Thanks
>>>>>>> Yang
>>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Jonathan Ellis
>>>>> Project Chair, Apache Cassandra
>>>>> co-founder of DataStax, the source for professional Cassandra support
>>>>> http://www.datastax.com
>>>>>
>>>>
>>>
>>
>



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com

Mime
View raw message