lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hoss Man (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-3124) explain output is confusing when using trie fields (or any field type where the indexed terms are not human readable)
Date Thu, 08 Mar 2012 23:07:59 GMT

     [ https://issues.apache.org/jira/browse/SOLR-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Hoss Man updated SOLR-3124:
---------------------------

    Description: 
using the trunk example schema containing...

{noformat}
<fieldType name="tint" class="solr.TrieIntField" precisionStep="8" positionIncrementGap="0"/>
<dynamicField name="*_ti" type="tint"    indexed="true"  stored="true"/>
{noformat}

...and indexing the doc...

{noformat}
$ java -Ddata=args -jar post.jar '<add><doc><field name="id">HOSS</field><field
name="foo_ti">42</field></doc></add>'
{noformat}

...results in a query for [foo_ti:42|http://localhost:8983/solr/select?q=foo_ti:42&start=0&rows=10&wt=json&debug.explain.structured=true&debugQuery=true&indent=true]
producing the following debug output...

{noformat}
  "debug":{
    "rawquerystring":"foo_ti:42",
    "querystring":"foo_ti:42",
    "parsedquery":"foo_ti:42",
    "parsedquery_toString":"foo_ti:`\b\u0000\u0000\u0000*",
    "explain":{
      "HOSS":{
        "match":true,
        "value":3.6741486,
        "description":"weight(foo_ti:`\b\u0000\u0000\u0000* in 0) [DefaultSimilarity], result
of:",
        "details":[{
            "match":true,
            "value":3.6741486,
            "description":"fieldWeight in 0, product of:",
            "details":[{
                "match":true,
                "value":1.0,
                "description":"tf(freq=1.0), with freq of:",
                "details":[{
                    "match":true,
                    "value":1.0,
                    "description":"termFreq=1.0"}]},
              {
                "match":true,
                "value":3.6741486,
                "description":"idf(docFreq=1, maxDocs=29)"},
              {
                "match":true,
                "value":1.0,
                "description":"fieldNorm(doc=0)"}]}]}},
...
{noformat}

  was:
defType=edismax&boost=query($param)&param=specialties_ids:32&debugQuery=true

<str name="2H7DF">
6.351252 = (MATCH) boost(*:*,query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)),
product of:
  1.0 = (MATCH) MatchAllDocsQuery, product of:
    1.0 = queryNorm
  6.351252 = query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)=6.351252
</str><str name="X5PJW">
6.351252 = (MATCH) boost(*:*,query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)),
product of:
  1.0 = (MATCH) MatchAllDocsQuery, product of:
    1.0 = queryNorm
  6.351252 = query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)=6.351252
</str>




        Summary: explain output is confusing when using trie fields (or any field type where
the indexed terms are not human readable)  (was: explain output looks unreadable when using
boost and edismax - #0; ?)

generalizing summary & description since the issue actually has nothing to do with "boosting"
and clarifying exactly how to reproduce (the field types used matter)

Bill: the fundamental problem is that the code for generating explain information works with
the indexed terms in the queries, which for some field types is non-readable.  The Solr FieldType
classes know how to format those indexed terms as readable strings, but the code for generating
Explanation objects is at a lower level in lucene and doens't know about the schema at all.


                
> explain output is confusing when using trie fields (or any field type where the indexed
terms are not human readable)
> ---------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-3124
>                 URL: https://issues.apache.org/jira/browse/SOLR-3124
>             Project: Solr
>          Issue Type: Bug
>    Affects Versions: 3.5
>            Reporter: Bill Bell
>
> using the trunk example schema containing...
> {noformat}
> <fieldType name="tint" class="solr.TrieIntField" precisionStep="8" positionIncrementGap="0"/>
> <dynamicField name="*_ti" type="tint"    indexed="true"  stored="true"/>
> {noformat}
> ...and indexing the doc...
> {noformat}
> $ java -Ddata=args -jar post.jar '<add><doc><field name="id">HOSS</field><field
name="foo_ti">42</field></doc></add>'
> {noformat}
> ...results in a query for [foo_ti:42|http://localhost:8983/solr/select?q=foo_ti:42&start=0&rows=10&wt=json&debug.explain.structured=true&debugQuery=true&indent=true]
producing the following debug output...
> {noformat}
>   "debug":{
>     "rawquerystring":"foo_ti:42",
>     "querystring":"foo_ti:42",
>     "parsedquery":"foo_ti:42",
>     "parsedquery_toString":"foo_ti:`\b\u0000\u0000\u0000*",
>     "explain":{
>       "HOSS":{
>         "match":true,
>         "value":3.6741486,
>         "description":"weight(foo_ti:`\b\u0000\u0000\u0000* in 0) [DefaultSimilarity],
result of:",
>         "details":[{
>             "match":true,
>             "value":3.6741486,
>             "description":"fieldWeight in 0, product of:",
>             "details":[{
>                 "match":true,
>                 "value":1.0,
>                 "description":"tf(freq=1.0), with freq of:",
>                 "details":[{
>                     "match":true,
>                     "value":1.0,
>                     "description":"termFreq=1.0"}]},
>               {
>                 "match":true,
>                 "value":3.6741486,
>                 "description":"idf(docFreq=1, maxDocs=29)"},
>               {
>                 "match":true,
>                 "value":1.0,
>                 "description":"fieldNorm(doc=0)"}]}]}},
> ...
> {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message