lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mark Miller (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-5509) ChaosMonkeyNothingIsSafeTest rare fails due to TOLEADER retries.
Date Sun, 01 Dec 2013 04:43:35 GMT

    [ https://issues.apache.org/jira/browse/SOLR-5509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13835952#comment-13835952
] 

Mark Miller commented on SOLR-5509:
-----------------------------------

I mentioned that because I was seeing it less - no real evidence though.

Anyway, what seems to be happening...

Document id's from two different threads can overlap over 50000. I never really figured this
was a big problem - at worst we would test having some updates. But...

Thread one adds a document with id > 50000 - it fails being forwarded to the leader and
a retry starts. Thread two then adds the same document with id > 50000 and it succeeds.
Thread two then deletes it's add, which happens on control and in the cloud. Finally, thread
one's retry works and adds the doc back to the cloud while it's not in control.

Not a great test I think - I'll try and ensure unique id's are used per thread for the test.

> ChaosMonkeyNothingIsSafeTest rare fails due to TOLEADER retries.
> ----------------------------------------------------------------
>
>                 Key: SOLR-5509
>                 URL: https://issues.apache.org/jira/browse/SOLR-5509
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>            Reporter: Mark Miller
>            Assignee: Mark Miller
>             Fix For: 5.0, 4.7
>
>         Attachments: cmns-test-cloud-off-by1-control-2.log
>
>
> {noformat}
>    [junit4]   2> 41386 T28 C21 P57194 oasup.LogUpdateProcessor.finish [collection1]
webapp= path=/update params={wt=javabin&CONTROL=TRUE&version=2} {add=[50086 (1452880907553734656)]}
0 142
>    [junit4]   2> 42009 T133 C27 P60411 oasup.LogUpdateProcessor.finish [collection1]
webapp= path=/update params={wt=javabin&version=2} {add=[50086 (1452880908206997504)]}
0 254
>    [junit4]   2> 42323 T27 C21 P57194 oasup.LogUpdateProcessor.finish [collection1]
webapp= path=/update params={wt=javabin&CONTROL=TRUE&version=2} {delete=[50086 (-1452880908537298944)]}
0 2
>    [junit4]   2> 42327 T131 C27 P60411 oasup.LogUpdateProcessor.finish [collection1]
webapp= path=/update params={wt=javabin&version=2} {delete=[50086 (-1452880908542541824)]}
0 1
>    [junit4]   2> 42622 T132 C27 P60411 oasup.LogUpdateProcessor.finish [collection1]
webapp= path=/update params={update.distrib=TOLEADER&wt=javabin&version=2} {add=[50086
(1452880908850823168)]} 0 1
>    [junit4]   2> 42623 T48 C22 P57136 oasup.LogUpdateProcessor.finish [collection1]
webapp= path=/update params={wt=javabin&version=2} {add=[50086]} 0 1223
>    [junit4]   2> ###### Only in cloudDocList: [{id=50086}]
>    [junit4]   2> 		cloudClient :{numFound=1,start=0,docs=[SolrDocument{id=50086, _version_=1452880908850823168}]}
> h
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message