lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-3180) ChaosMonkey test failures
Date Thu, 03 Jan 2013 18:32:13 GMT

     [ https://issues.apache.org/jira/browse/SOLR-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Yonik Seeley updated SOLR-3180:
-------------------------------

    Attachment: fail.130103_105104.txt

Here's a log with some notes on the first timeout.  Chaos monkey decides to cause connection
loss to ZK at 48 sec.  At 60 sec, two requests start that don't seem to finish until 83 sec
into the test.  They seem to be blocked in zkCheck().

{code}
  2> 48972 T260 oasc.ChaosMonkey.monkeyLog monkey: chose a victim! 42854
  2> 48973 T260 oasc.ChaosMonkey.monkeyLog monkey: expire session for 42854 !
  2> 48975 T260 oasc.ChaosMonkey.monkeyLog monkey: cause connection loss!

 
  2> 52127 T122 C3 P42854 /update {distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}
{delete=[60058 (-1423156803769729024)]} 0 1
  2> 60518 T120 C3 P42854 oasup.LogUpdateProcessor.processAdd PRE_UPDATE ADD add{flags=0,_version_=0,id=10063}
{distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}
  2> 60786 T124 C3 P42854 oasup.LogUpdateProcessor.processDelete PRE_UPDATE DELETE delete{flags=0,_version_=-1423156812849348608,id=60059,commitWithin=-1}
{distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}

  2> 75667 T74 C6 P51342 oasc.Diagnostics.logThreadDumps SEVERE REQUESTING THREAD DUMP
DUE TO TIMEOUT: Timeout occured while waiting response from server at: http://127.0.0.1:42854/r_/f/collection1
[...]
  2> 	"qtp1333272771-124" Id=124 TIMED_WAITING
  2> 		at java.lang.Thread.sleep(Native Method)
  2> 		at org.apache.solr.update.processor.DistributedUpdateProcessor.zkCheck(DistributedUpdateProcessor.java:925)
  2> 		at org.apache.solr.update.processor.DistributedUpdateProcessor.processDelete(DistributedUpdateProcessor.java:699)
  2> 		at org.apache.solr.update.processor.LogUpdateProcessor.processDelete(LogUpdateProcessor.java:97)
  2> 		at org.apache.solr.handler.loader.XMLLoader.processDelete(XMLLoader.java:346)
  2> 		at org.apache.solr.handler.loader.XMLLoader.processUpdate(XMLLoader.java:277)
  2> 		at org.apache.solr.handler.loader.XMLLoader.load(XMLLoader.java:173)
  2> 		at org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:92)
[...]
  2> 	"qtp1333272771-120" Id=120 TIMED_WAITING
  2> 		at java.lang.Thread.sleep(Native Method)
  2> 		at org.apache.solr.update.processor.DistributedUpdateProcessor.zkCheck(DistributedUpdateProcessor.java:925)
  2> 		at org.apache.solr.update.processor.DistributedUpdateProcessor.processAdd(DistributedUpdateProcessor.java:330)
  2> 		at org.apache.solr.update.processor.LogUpdateProcessor.processAdd(LogUpdateProcessor.java:76)
  2> 		at org.apache.solr.handler.loader.XMLLoader.processUpdate(XMLLoader.java:246)
  2> 		at org.apache.solr.handler.loader.XMLLoader.load(XMLLoader.java:173)
  2> 		at org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:92)
  2> 		at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74)

  2> 83174 T124 C3 P42854 PRE_UPDATE FINISH  {distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}
  2> 83174 T120 C3 P42854 PRE_UPDATE FINISH  {distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}

{code}
                
> ChaosMonkey test failures
> -------------------------
>
>                 Key: SOLR-3180
>                 URL: https://issues.apache.org/jira/browse/SOLR-3180
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>            Reporter: Yonik Seeley
>         Attachments: CMSL_fail1.log, CMSL_hang_2.txt, CMSL_hang.txt, fail.130101_034142.txt,
fail.130102_020942.txt, fail.130103_105104.txt, fail.inconsistent.txt, test_report_1.txt
>
>
> Handle intermittent failures in the ChaosMonkey tests.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message