lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-3357) Unit and integration test cases for the new Similarities
Date Wed, 10 Aug 2011 12:24:27 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-3357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13082303#comment-13082303
] 

Robert Muir commented on LUCENE-3357:
-------------------------------------

Ok, here is what i did here for BM25:

in the case norms are omitted by the user, the formula behaves as if b=0 (no length normalization).
so this is always a possibility sims should handle, thoguh for EasySimilarity perhaps it should
just supply doclen=1 or something of that nature?

in the case norms are available, but sumTotalTermFreq is not (e.g. frequencies are omitted
by the user), I use a value of 1 for avg doc len... This is probably ok
since in the case of omitTF all the TF values will be 1 anyway.

> Unit and integration test cases for the new Similarities
> --------------------------------------------------------
>
>                 Key: LUCENE-3357
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3357
>             Project: Lucene - Java
>          Issue Type: Sub-task
>          Components: core/query/scoring
>    Affects Versions: flexscoring branch
>            Reporter: David Mark Nemeskey
>            Assignee: David Mark Nemeskey
>            Priority: Minor
>              Labels: gsoc, gsoc2011, test
>             Fix For: flexscoring branch
>
>         Attachments: LUCENE-3357.patch, LUCENE-3357.patch, LUCENE-3357.patch, LUCENE-3357.patch,
LUCENE-3357.patch, LUCENE-3357.patch, LUCENE-3357.patch, LUCENE-3357.patch
>
>
> Write test cases to test the new Similarities added in [LUCENE-3220|https://issues.apache.org/jira/browse/LUCENE-3220].
Two types of test cases will be created:
>  * unit tests, in which mock statistics are provided to the Similarities and the score
is validated against hand calculations;
>  * integration tests, in which a small collection is indexed and then searched using
the Similarities.
> Performance tests will be performed in a separate issue.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message