Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 95646 invoked from network); 25 Oct 2008 12:58:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 Oct 2008 12:58:36 -0000 Received: (qmail 80469 invoked by uid 500); 25 Oct 2008 12:58:38 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 80425 invoked by uid 500); 25 Oct 2008 12:58:38 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 80387 invoked by uid 99); 25 Oct 2008 12:58:38 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 25 Oct 2008 05:58:38 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 25 Oct 2008 12:57:34 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 48384234C236 for ; Sat, 25 Oct 2008 05:57:44 -0700 (PDT) Message-ID: <678369234.1224939464294.JavaMail.jira@brutus> Date: Sat, 25 Oct 2008 05:57:44 -0700 (PDT) From: "Mark Harwood (JIRA)" To: java-dev@lucene.apache.org Subject: [jira] Commented: (LUCENE-1424) Add ConstantScorePrefixQuery and ConstantScoreWildcardQuery In-Reply-To: <2115535757.1224422583771.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/LUCENE-1424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12642615#action_12642615 ] Mark Harwood commented on LUCENE-1424: -------------------------------------- >> Are the score differences caused by the rewrite-to-BooleanQuery implementations ever "useful"? So we need to consider what we are losing - TF, IDF, coordination, length norm, doc boosts. I can only think of one use case which relates to coordination factor. If you have a "category" field for a product e.g. given Lucene docs for these books: Title: Lucene in Action Category: /Books/Computing/Languages/Java /Books/Computing/InformationRetrieval Title: The Long Tail Category: /Books/Business/Internet /Books/Computing You might then use a wildcard search of /Books/Computing/* and "Lucene in Action" would rank higher than "The Long Tail" because a BooleanQuery would score a higher coordination factor suggesting LIA got more hits under this "/Books/Computing.." category. There would still be the issue of IDF potentially skewing results but the coordination factor is potentially useful here. I think in general IDF tends to be useless for "auto-expanded" terms e.g. Wildcard, fuzzy etc. Incidentally, we still see that IDF issue in fuzzy queries ranking rare mis-spellings higher but that's another issue (one I resolved in contrib's FuzzyLikeThisQuery). I suppose one other consideration is for people who have created any doc boosts e.g. trying to use this to boost by date. I don't think any of these cases necessarily outweigh the benefit to be obtained from switching "wildcard/prefix to constant score queries" Cheers, Mark > Add ConstantScorePrefixQuery and ConstantScoreWildcardQuery > ----------------------------------------------------------- > > Key: LUCENE-1424 > URL: https://issues.apache.org/jira/browse/LUCENE-1424 > Project: Lucene - Java > Issue Type: New Feature > Reporter: Mark Miller > Assignee: Michael McCandless > Priority: Minor > Attachments: LUCENE-1424.patch > > > If we want to be able to highlight these queries, they need to be added to Lucene core or contrib (solr's WildCardFilter can be used to create the ConstantScoreWildcardQuery). They are very useful anyway. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org