Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5C27F9591 for ; Thu, 8 Mar 2012 23:08:25 +0000 (UTC) Received: (qmail 53412 invoked by uid 500); 8 Mar 2012 23:08:24 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 53348 invoked by uid 500); 8 Mar 2012 23:08:23 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 53272 invoked by uid 99); 8 Mar 2012 23:08:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 08 Mar 2012 23:08:23 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 08 Mar 2012 23:08:20 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 8425510379 for ; Thu, 8 Mar 2012 23:07:59 +0000 (UTC) Date: Thu, 8 Mar 2012 23:07:59 +0000 (UTC) From: "Hoss Man (Updated) (JIRA)" To: dev@lucene.apache.org Message-ID: <948234856.41113.1331248079542.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <195692327.28796.1329002579613.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (SOLR-3124) explain output is confusing when using trie fields (or any field type where the indexed terms are not human readable) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/SOLR-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-3124: --------------------------- Description: using the trunk example schema containing... {noformat} {noformat} ...and indexing the doc... {noformat} $ java -Ddata=args -jar post.jar 'HOSS42' {noformat} ...results in a query for [foo_ti:42|http://localhost:8983/solr/select?q=foo_ti:42&start=0&rows=10&wt=json&debug.explain.structured=true&debugQuery=true&indent=true] producing the following debug output... {noformat} "debug":{ "rawquerystring":"foo_ti:42", "querystring":"foo_ti:42", "parsedquery":"foo_ti:42", "parsedquery_toString":"foo_ti:`\b\u0000\u0000\u0000*", "explain":{ "HOSS":{ "match":true, "value":3.6741486, "description":"weight(foo_ti:`\b\u0000\u0000\u0000* in 0) [DefaultSimilarity], result of:", "details":[{ "match":true, "value":3.6741486, "description":"fieldWeight in 0, product of:", "details":[{ "match":true, "value":1.0, "description":"tf(freq=1.0), with freq of:", "details":[{ "match":true, "value":1.0, "description":"termFreq=1.0"}]}, { "match":true, "value":3.6741486, "description":"idf(docFreq=1, maxDocs=29)"}, { "match":true, "value":1.0, "description":"fieldNorm(doc=0)"}]}]}}, ... {noformat} was: defType=edismax&boost=query($param)¶m=specialties_ids:32&debugQuery=true 6.351252 = (MATCH) boost(*:*,query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)), product of: 1.0 = (MATCH) MatchAllDocsQuery, product of: 1.0 = queryNorm 6.351252 = query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)=6.351252 6.351252 = (MATCH) boost(*:*,query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)), product of: 1.0 = (MATCH) MatchAllDocsQuery, product of: 1.0 = queryNorm 6.351252 = query(specialties_ids: #1;#0;#0;#0;#0;#0;#0;#0;#0; ,def=0.0)=6.351252 Summary: explain output is confusing when using trie fields (or any field type where the indexed terms are not human readable) (was: explain output looks unreadable when using boost and edismax - #0; ?) generalizing summary & description since the issue actually has nothing to do with "boosting" and clarifying exactly how to reproduce (the field types used matter) Bill: the fundamental problem is that the code for generating explain information works with the indexed terms in the queries, which for some field types is non-readable. The Solr FieldType classes know how to format those indexed terms as readable strings, but the code for generating Explanation objects is at a lower level in lucene and doens't know about the schema at all. > explain output is confusing when using trie fields (or any field type where the indexed terms are not human readable) > --------------------------------------------------------------------------------------------------------------------- > > Key: SOLR-3124 > URL: https://issues.apache.org/jira/browse/SOLR-3124 > Project: Solr > Issue Type: Bug > Affects Versions: 3.5 > Reporter: Bill Bell > > using the trunk example schema containing... > {noformat} > > > {noformat} > ...and indexing the doc... > {noformat} > $ java -Ddata=args -jar post.jar 'HOSS42' > {noformat} > ...results in a query for [foo_ti:42|http://localhost:8983/solr/select?q=foo_ti:42&start=0&rows=10&wt=json&debug.explain.structured=true&debugQuery=true&indent=true] producing the following debug output... > {noformat} > "debug":{ > "rawquerystring":"foo_ti:42", > "querystring":"foo_ti:42", > "parsedquery":"foo_ti:42", > "parsedquery_toString":"foo_ti:`\b\u0000\u0000\u0000*", > "explain":{ > "HOSS":{ > "match":true, > "value":3.6741486, > "description":"weight(foo_ti:`\b\u0000\u0000\u0000* in 0) [DefaultSimilarity], result of:", > "details":[{ > "match":true, > "value":3.6741486, > "description":"fieldWeight in 0, product of:", > "details":[{ > "match":true, > "value":1.0, > "description":"tf(freq=1.0), with freq of:", > "details":[{ > "match":true, > "value":1.0, > "description":"termFreq=1.0"}]}, > { > "match":true, > "value":3.6741486, > "description":"idf(docFreq=1, maxDocs=29)"}, > { > "match":true, > "value":1.0, > "description":"fieldNorm(doc=0)"}]}]}}, > ... > {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org