lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Rowe (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-5652) Heisenbug in DistribCursorPagingTest: "walk already seen ..."
Date Sun, 26 Jan 2014 00:19:38 GMT

    [ https://issues.apache.org/jira/browse/SOLR-5652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13882122#comment-13882122
] 

Steve Rowe commented on SOLR-5652:
----------------------------------

Uwe's Jenkins found a "walk already seen" *Linux* failure on branch_4x using Oracle 1.8.0-ea-b124
JVM: [http://jenkins.thetaphi.de/job/Lucene-Solr-4.x-Linux/9136/] - a missing value str_dv
doc (id=2) appears at both the first *and* last positions and pages:

{noformat}
   [junit4]   2> 414765 T2177 oasc.DistribCursorPagingTest.assertFullWalkNoDups SOLR-5652:
({params(q=*%3A*&fl=id%2Cstr_dv_last&rows=41&sort=str_dv_last+asc%2C+id+asc),defaults(cursorMark=*)})
gave us these docs: {id=2}; {id=3, str_dv_last=}; {id=9, str_dv_last=}; {id=18, str_dv_last=};
{id=35, str_dv_last=}; {id=41}; {id=43, str_dv_last=}; {id=46, str_dv_last=}; {id=48, str_dv_last=};
{id=51, str_dv_last=}; {id=55}; {id=80, str_dv_last=}; {id=125, str_dv_last=}; {id=117, str_dv_last=u}xPe};
{id=11, str_dv_last=CVLjZ-k U}; {id=104, str_dv_last=g}; {id=66, str_dv_last=h}; {id=27, str_dv_last=j};
{id=120, str_dv_last=l}; {id=128, str_dv_last=o}; {id=19, str_dv_last=q}; {id=76, str_dv_last=t};
{id=42, str_dv_last=x}; {id=92, str_dv_last=y}; {id=40, str_dv_last=z}; {id=6, str_dv_last=???????????};
{id=82, str_dv_last=????}; {id=1, str_dv_last=??}; {id=103, str_dv_last=???????}; {id=15,
str_dv_last=???????????????????}; {id=81, str_dv_last=??????????}; {id=29, str_dv_last=?};
{id=98, str_dv_last=????????????????}; {id=78, str_dv_last=????????????}; {id=72, str_dv_last=??????????????};
{id=73, str_dv_last=???????????????????}; {id=33, str_dv_last=???????????}; {id=121, str_dv_last=????????????????};
{id=97, str_dv_last=??}; {id=47, str_dv_last=??????????????}; {id=32, str_dv_last=????????????};

   [junit4]   2> 414781 T2177 oasc.DistribCursorPagingTest.assertFullWalkNoDups SOLR-5652:
({params(q=*%3A*&fl=id%2Cstr_dv_last&rows=41&sort=str_dv_last+asc%2C+id+asc),defaults(cursorMark=AoI%2FBeC%2FpeC8neC9teC%2Bu%2BC9nOC%2FhuC%2BgeC8geC9ruC%2BgeC9s%2BC%2Bl1AC)})
gave us these docs: {id=106, str_dv_last=??????}; {id=54, str_dv_last=??????}; {id=94, str_dv_last=????};
{id=63, str_dv_last=????????????????????}; {id=88, str_dv_last=??}; {id=24, str_dv_last=??};
{id=17, str_dv_last=????????}; {id=99, str_dv_last=??????????}; {id=7, str_dv_last=?????????????????};
{id=50, str_dv_last=?????????????}; {id=85, str_dv_last=????????????}; {id=77, str_dv_last=??};
{id=102, str_dv_last=?}; {id=89, str_dv_last=??????????????}; {id=8, str_dv_last=????????};
{id=74, str_dv_last=?????????????????}; {id=60, str_dv_last=????????????????}; {id=67, str_dv_last=?????};
{id=109, str_dv_last=????????????????}; {id=118, str_dv_last=???????????}; {id=112, str_dv_last=?};
{id=56, str_dv_last=???}; {id=5, str_dv_last=????????}; {id=95, str_dv_last=???????????????????};
{id=22, str_dv_last=??????????}; {id=31, str_dv_last=???????????????}; {id=16, str_dv_last=?};
{id=52, str_dv_last=???????????}; {id=57, str_dv_last=????????????????????}; {id=113, str_dv_last=???????????};
{id=91, str_dv_last=??}; {id=34, str_dv_last=??????}; {id=39, str_dv_last=?????????}; {id=59,
str_dv_last=?????}; {id=61, str_dv_last=???????}; {id=44, str_dv_last=?????????????????};
{id=69, str_dv_last=????????????}; {id=119, str_dv_last=???????}; {id=12, str_dv_last=??????};
{id=90, str_dv_last=????}; {id=123, str_dv_last=???????????????????}; 
   [junit4]   2> 414795 T2177 oasc.DistribCursorPagingTest.assertFullWalkNoDups SOLR-5652:
({params(q=*%3A*&fl=id%2Cstr_dv_last&rows=41&sort=str_dv_last+asc%2C+id+asc),defaults(cursorMark=AoI%2FGuOEm%2BOEpOOEjuOEoeOEmeOEpuOEo%2BOEmeOEh%2BOEneOEjeOEheOErOOEguOEoeOElOOEluOEpeOEgVsH)})
gave us these docs: {id=68, str_dv_last=??????????????}; {id=96, str_dv_last=?????????????};
{id=45, str_dv_last=????????}; {id=38, str_dv_last=??????????}; {id=110, str_dv_last=???????????????};
{id=20, str_dv_last=??????}; {id=129, str_dv_last=??????????????????}; {id=75, str_dv_last=???????};
{id=79, str_dv_last=????}; {id=62, str_dv_last=?????????}; {id=30, str_dv_last=??????????????????};
{id=111, str_dv_last=???????????}; {id=28, str_dv_last=????????}; {id=105, str_dv_last=??????????????};
{id=100, str_dv_last=?}; {id=37, str_dv_last=??????}; {id=115, str_dv_last=???????????????};
{id=107, str_dv_last=?????????????}; {id=49, str_dv_last=??}; {id=93, str_dv_last=???????????????};
{id=65, str_dv_last=??????}; {id=25, str_dv_last=????????}; {id=14, str_dv_last=????????};
{id=83, str_dv_last=???????????????????}; {id=124, str_dv_last=????????????????}; {id=114,
str_dv_last=???????????????????}; {id=71, str_dv_last=??????????????????}; {id=108, str_dv_last=???????????????};
{id=36, str_dv_last=?????????????????}; {id=86, str_dv_last=??????????}; {id=122, str_dv_last=?????????};
{id=84, str_dv_last=?????????}; {id=127, str_dv_last=???????????}; {id=70, str_dv_last=?????????????};
{id=13, str_dv_last=????}; {id=10, str_dv_last=?????}; {id=64, str_dv_last=????}; {id=26,
str_dv_last=???????????}; {id=58, str_dv_last=??????????????????}; {id=4, str_dv_last=?????????????????};
{id=101, str_dv_last=??????????}; 
   [junit4]   2> 414805 T2177 oasc.DistribCursorPagingTest.assertFullWalkNoDups SOLR-5652:
({params(q=*%3A*&fl=id%2Cstr_dv_last&rows=41&sort=str_dv_last+asc%2C+id+asc),defaults(cursorMark=AoI%2FCfCdhojwnYSm8J2Es%2FCdhKvwnYWo8J2FqfCdhYfwnYaE8J2HpPCdhqJVBg%3D%3D)})
gave us these docs: {id=87, str_dv_last=??}; {id=116, str_dv_last=?????}; {id=23, str_dv_last=???????????};
{id=53, str_dv_last=??????}; {id=126, str_dv_last=???????????????}; {id=21, str_dv_last=???????????};

   [junit4]   2> 414813 T2177 oasc.DistribCursorPagingTest.assertFullWalkNoDups SOLR-5652:
({params(q=*%3A*&fl=id%2Cstr_dv_last&rows=41&sort=str_dv_last+asc%2C+id+asc),defaults(cursorMark=AoI%2FDfSEm7v0jba69IqEqvSFmYX0hram9IeujPSLqIX0ibic9IeajfSAlKX0hZqwVQE%3D)})
gave us these docs: {id=2}; 
   [junit4]    > Throwable #1: java.lang.AssertionError: ({params(q=*%3A*&fl=id%2Cstr_dv_last&rows=41&sort=str_dv_last+asc%2C+id+asc),defaults(cursorMark=AoI%2FDfSEm7v0jba69IqEqvSFmYX0hram9IeujPSLqIX0ibic9IeajfSAlKX0hZqwVQE%3D)})
walk already seen: 2, don't know why; q=id:2 gives: {responseHeader={status=0,QTime=5},response={numFound=1,start=0,maxScore=4.3672957,docs=[SolrDocument{id=2,
int=-1269312747, int_last=-1269312747, int_first=-1269312747, int_dv_last=-1269312747, int_dv_first=-1269312747,
long=-7073619055149191639, long_last=-7073619055149191639, long_first=-7073619055149191639,
long_dv_last=-7073619055149191639, long_dv_first=-7073619055149191639, float=-5.1121048E7,
float_last=-5.1121048E7, float_first=-5.1121048E7, float_dv_last=-5.1121048E7, float_dv_first=-5.1121048E7,
double=1.209537768077775E9, double_last=1.209537768077775E9, double_first=1.209537768077775E9,
double_dv_last=1.209537768077775E9, double_dv_first=1.209537768077775E9, _version_=1458244495980101632}]}}
{noformat}

Also, I found a *trunk* failure on OS X using Oracle 1.7.0_25 JVM - one failure out of 875
trials:

{noformat}
797.log:   [junit4]   2> 33038 T10 oasc.DistribCursorPagingTest.doRandomSortsOnLargeIndex
SOLR-5652: Begining Loop over smallish num of docs
797.log:   [junit4]    > Throwable #1: java.lang.AssertionError: ({params(q=*%3A*&fl=id%2Clong_dv_last&rows=41&sort=long_dv_last+asc%2C+id+asc),defaults(cursorMark=AoIHQPUmIuYgRStcBA%3D%3D)})
walk already seen: 94, don't know why; q=id:94 gives: {responseHeader={status=0,QTime=8},response={numFound=1,start=0,maxScore=4.650658,docs=[SolrDocument{id=94,
int=-1526187839, int_last=-1526187839, int_first=-1526187839, int_dv_last=-1526187839, int_dv_first=-1526187839,
float=-4.343064E8, float_last=-4.343064E8, float_first=-4.343064E8, float_dv_last=-4.343064E8,
float_dv_first=-4.343064E8, double=2.4889970242543688E8, double_last=2.4889970242543688E8,
double_first=2.4889970242543688E8, double_dv_last=2.4889970242543688E8, double_dv_first=2.4889970242543688E8,
str=႗ၨတဟက႖ဘပၩရၢ၇ႃ, str_last=႗ၨတဟက႖ဘပၩရၢ၇ႃ,
str_first=႗ၨတဟက႖ဘပၩရၢ၇ႃ, str_dv_last=႗ၨတဟက႖ဘပၩရၢ၇ႃ,
str_dv_first=႗ၨတဟက႖ဘပၩရၢ၇ႃ, bin=1OWONkZ+pQ9N1WTd+bro5Ubqx8FFEGtqPDSD,
bin_last=1OWONkZ+pQ9N1WTd+bro5Ubqx8FFEGtqPDSD, bin_first=1OWONkZ+pQ9N1WTd+bro5Ubqx8FFEGtqPDSD,
bin_dv_last=1OWONkZ+pQ9N1WTd+bro5Ubqx8FFEGtqPDSD, bin_dv_first=1OWONkZ+pQ9N1WTd+bro5Ubqx8FFEGtqPDSD,
_version_=1458175223436148736}]}}
{noformat}

Also, I ran 1000 trials on Win7 on branch_4x using Oracle 1.6.0_45 JVM, and saw 0 failures.

> Heisenbug in DistribCursorPagingTest: "walk already seen ..."
> -------------------------------------------------------------
>
>                 Key: SOLR-5652
>                 URL: https://issues.apache.org/jira/browse/SOLR-5652
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Hoss Man
>            Assignee: Hoss Man
>         Attachments: 129.log, 372.log, jenkins.thetaphi.de_Lucene-Solr-4.x-MacOSX_1200.log.txt,
jenkins.thetaphi.de_Lucene-Solr-4.x-MacOSX_1217.log.txt
>
>
> Twice now, Uwe's jenkins has encountered a "walk already seen ..." assertion failure
from DistribCursorPagingTest that I've been unable to fathom, let alone reproduce (although
sarowe was able to trigger a similar, non-reproducible seed, failure on his machine)
> Using this as a tracking issue to try and make sense of it.
> Summary of things noticed so far (in 3 failures):
> * So far only seen on http://jenkins.thetaphi.de & sarowe's mac
> * So far only seen on MacOSX
> * So far only seen on branch 4x
> * So far seen on both Java6 and Java7
> * fails occured in first block of randomized testing: 
> ** we've indexed a small number of randomized docs
> ** we're explicitly looping over every field and sorting in both directions
> * fails were both when doing a "desc" sorting on one of the "\*_dv_last" or "\*_dv_first"
fields (docValues=true, either sortMissingLast=true OR sortMissingFirst=true) 
> ** sort on same field asc has always worked fine just before this (fields are in arbitrary
order, but "asc" always tried before "desc")
> ** sorting on some other random fields has sometimes been tried before this and worked
> (specifics of each failure seen in the wild recorded in comments)



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message