lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tomás Fernández Löbbe (JIRA) <j...@apache.org>
Subject [jira] [Created] (SOLR-12041) NPE in ChaosMonkeyNothingIsSafeWithPullReplicasTest
Date Tue, 27 Feb 2018 20:57:00 GMT
Tomás Fernández Löbbe created SOLR-12041:
--------------------------------------------

             Summary: NPE in ChaosMonkeyNothingIsSafeWithPullReplicasTest
                 Key: SOLR-12041
                 URL: https://issues.apache.org/jira/browse/SOLR-12041
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
          Components: Tests
            Reporter: Tomás Fernández Löbbe
            Assignee: Tomás Fernández Löbbe


I found this failure in Steve’s Jenkins (http://fucit.org/solr-jenkins-reports/job-data/sarowe/Lucene-Solr-tests-7.x/2910/):
{noformat}
   [junit4]   2> NOTE: reproduce with: ant test  -Dtestcase=ChaosMonkeyNothingIsSafeWithPullReplicasTest
-Dtests.method=test -Dtests.seed=FEC5BFCB68EE30B1 -Dtests.slow=true -Dtests.locale=he-IL -Dtests.timezone=Pacific/Midway
-Dtests.asserts=true -Dtests.file.encoding=UTF-8
   [junit4] ERROR   37.8s J8  | ChaosMonkeyNothingIsSafeWithPullReplicasTest.test <<<
   [junit4]    > Throwable #1: java.lang.NullPointerException
   [junit4]    >        at __randomizedtesting.SeedInfo.seed([FEC5BFCB68EE30B1:76918011C6125D49]:0)
   [junit4]    >        at org.apache.solr.cloud.AbstractFullDistribZkTestBase.getIndexVersion(AbstractFullDistribZkTestBase.java:2172)
   [junit4]    >        at org.apache.solr.cloud.AbstractFullDistribZkTestBase.waitForReplicationFromReplicas(AbstractFullDistribZkTestBase.java:2110)
   [junit4]    >        at org.apache.solr.cloud.ChaosMonkeyNothingIsSafeWithPullReplicasTest.test(ChaosMonkeyNothingIsSafeWithPullReplicasTest.java:268)
   [junit4]    >        at org.apache.solr.BaseDistributedSearchTestCase$ShardsRepeatRule$ShardsFixedStatement.callStatement(BaseDistributedSearchTestCase.java:993)
   [junit4]    >        at org.apache.solr.BaseDistributedSearchTestCase$ShardsRepeatRule$ShardsStatement.evaluate(BaseDistributedSearchTestCase.java:968)
   [junit4]    >        at java.lang.Thread.run(Thread.java:748)
{noformat}

It seems to be caused by this code:
{code:java}
    for(Slice s:collection.getSlices()) {
      Replica leader = s.getLeader();
      long leaderIndexVersion = -1;
      while (!timeout.hasTimedOut()) {
        —> leaderIndexVersion = getIndexVersion(leader);
{code}
and I believe the problem is that the shard may not have a leader at the time of this check.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message