lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Doron Cohen (JIRA)" <>
Subject [jira] Commented: (LUCENE-736) Sloppy Phrase Scoring Misbehavior
Date Thu, 19 Apr 2007 05:23:16 GMT


Doron Cohen commented on LUCENE-736:

Need to see if the parts of the test (in QueryUtils) that were disabled by LUCENE-730 (BooleanScorer2
sometimes falls back to BooleanScorer). One possibility is to have two versions of this -
a BooleanScoere version, and the rest - this issue (736) is about sloppy/exact phrase scoring,
so it would fall into the "rest", and so the test would still catch this.

> Sloppy Phrase Scoring Misbehavior
> ---------------------------------
>                 Key: LUCENE-736
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Search
>            Reporter: Doron Cohen
>         Assigned To: Doron Cohen
>            Priority: Minor
>         Attachments: perf-search-new.log, perf-search-orig.log, res-search-new2.log,
res-search-orig2.log, sloppy_phrase.patch2.txt, sloppy_phrase.patch3.txt, sloppy_phrase_java.patch.txt,
> This is an extension of
> In addition to abnormalities Yonik pointed out in 697, there seem to be other issues
with slopy phrase search and scoring.
> 1) A phrase with a repeated word would be detected in a document although it is not there.
> I.e. document = A B D C E , query = "B C B" would not find this document (as expected),
but query "B C B"~2 would find it. 
> I think that no matter how large the slop is, this document should not be a match.
> 2) A document containing both orders of a query, symmetrically, would score differently
for the queru and for its reveresed form.
> I.e. document = A B C B A would score differently for queries "B C"~2 and "C B"~2, although
it is symmetric to both.
> I will attach test cases that show both these problems and the one reported by Yonik
in 697. 

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message