lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hoss Man (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (SOLR-12040) HdfsBasicDistributedZkTest & HdfsBasicDistributedZk2 fail on virtually every jenkins run
Date Tue, 27 Feb 2018 19:08:00 GMT

     [ https://issues.apache.org/jira/browse/SOLR-12040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Hoss Man reassigned SOLR-12040:
-------------------------------

    Assignee: Mark Miller

FWIW: I've poked around in the jenkins logs (for jobs where there are failures at both the
method level and suite level) looking for any red flags in the solr log messages and couldn't
find anything obvious – there are occasional InteruptedExceptions logged by the HDFS layer,
and TriggerInjection occasionally complains that a node is out of sync with the leader –
but i see these same types of exceptions in the logs when i run the tests locally and it passes.

[~markrmiller@gmail.com]: can you please help diagnose these failures?

> HdfsBasicDistributedZkTest & HdfsBasicDistributedZk2 fail on virtually every jenkins
run
> ----------------------------------------------------------------------------------------
>
>                 Key: SOLR-12040
>                 URL: https://issues.apache.org/jira/browse/SOLR-12040
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Hoss Man
>            Assignee: Mark Miller
>            Priority: Major
>
> HdfsBasicDistributedZkTest & HdfsBasicDistributedZk2 are thin subclasses of BasicDistributedZkTest
& BasicDistributedZk2 that just tweak the setup to use HDFS, and only run @Nightly.
> These tests are failing virtually every time they are run by jenkins - either at a method
level, or at a suite level (due to threadleaks, timeouts, etc...) yet their non-HDFS superclasss
virtually never fail.
> Per the jenkins failure rates reports i've setup, here's the failure rates of all tests
matching "BasicDistributed" for the past 7days (note that the non-HDFS tests aren't even listed,
because they haven't failed at all even though they are non-nightly and have cumulatively
run ~750 times in the past 7 days)
> http://fucit.org/solr-jenkins-reports/failure-report.html
> {noformat}
> "Suite?","Class","Method","Rate","Runs","Fails"
> "true","org.apache.solr.cloud.hdfs.HdfsBasicDistributedZk2Test","","53.3333333333333","15","8"
> "false","org.apache.solr.cloud.hdfs.HdfsBasicDistributedZk2Test","test","18.75","16","3"
> "true","org.apache.solr.cloud.hdfs.HdfsBasicDistributedZkTest","","46.1538461538462","13","6"
> "false","org.apache.solr.cloud.hdfs.HdfsBasicDistributedZkTest","test","7.69230769230769","13","1"
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message