cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sylvain Lebresne (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-10938) test_bulk_round_trip_blogposts is failing occasionally
Date Wed, 06 Jan 2016 16:13:39 GMT


Sylvain Lebresne commented on CASSANDRA-10938:

I agree with changing to CHM here if that fixes Windows and cassci as performance is not terribly
critical in that case (important but not critical).

It does very much look like we're hitting either a very bad case or a bug in NBHM. From a
very quick look at the source, the methods that takes all the CPU have some CAS loops so if
lots of threads are trying to access the same slot (which sounds to be the case if I understand
the failing test correctly) while a resizing is going on, then I suppose crappy scheduling
could lead us there. That bad behavior is a bit worrisome in the sense that we use NBHM in
other places, but it's also the first time we see this behavior so it's unclear the ROI of
investigating too much is worth it right now.

> test_bulk_round_trip_blogposts is failing occasionally
> ------------------------------------------------------
>                 Key: CASSANDRA-10938
>                 URL:
>             Project: Cassandra
>          Issue Type: Sub-task
>          Components: Tools
>            Reporter: Stefania
>            Assignee: Stefania
>             Fix For: 2.1.x
>         Attachments: 6452.nps, 6452.png, 7300.nps, 7300a.png, 7300b.png, node1_debug.log,
node2_debug.log, node3_debug.log, recording_127.0.0.1.jfr
> We get timeouts occasionally that cause the number of records to be incorrect:

This message was sent by Atlassian JIRA

View raw message