Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 68585 invoked from network); 19 Apr 2007 23:46:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 19 Apr 2007 23:46:29 -0000 Received: (qmail 30904 invoked by uid 500); 19 Apr 2007 23:46:29 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 30870 invoked by uid 500); 19 Apr 2007 23:46:28 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 30859 invoked by uid 99); 19 Apr 2007 23:46:28 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Apr 2007 16:46:28 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (herse.apache.org: local policy) Received: from [169.229.70.167] (HELO rescomp.berkeley.edu) (169.229.70.167) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Apr 2007 16:46:21 -0700 Received: by rescomp.berkeley.edu (Postfix, from userid 1007) id 448CF5B773; Thu, 19 Apr 2007 16:46:01 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by rescomp.berkeley.edu (Postfix) with ESMTP id 3ACEE7F403 for ; Thu, 19 Apr 2007 16:46:01 -0700 (PDT) Date: Thu, 19 Apr 2007 16:46:01 -0700 (PDT) From: Chris Hostetter To: java-user@lucene.apache.org Subject: Re: Document Boost In-Reply-To: <4627F377.5050108@affinitycircles.com> Message-ID: References: <10086883.post@talk.nabble.com> <359a92830704191418jc640f5fr6187490790f4e3f7@mail.gmail.com> <10088201.post@talk.nabble.com> <4627F377.5050108@affinitycircles.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Checked: Checked by ClamAV on apache.org The full post Erick alluded too may be helpful... http://mail-archives.apache.org/mod_mbox/lucene-java-user/200609.mbox/%3CPine.LNX.4.58.0609271134380.32280@hal.rescomp.berkeley.edu%3E in general, if your goal is that words in the "metadata" of a document should be worth more then words in the "body" then you should have two fields: "metadata" and "body", you shoudl query for the same word in both fields, and your query should have a query boost on the "metadata" part. : way it seems to work, is that if you boost a field then you have to : actually specify that field in your query to benefit from that field correct ... you are saying "this documents field_X is better then other document's field_X" .. but that info only comes into play if you actually use "field_X" : ignored. I hacked around this by just adding the field's text to the : default search field n times where n is the boost for that field. I : seriously doubt that this the ideal way to do it, but I couldn't figure : out how to do it other than reforming all of my queries to search all : the fields. that is a feasible solution to the use case where "this doc's use of the word FOO in field BAR is more significant then the use of the word FOO in field BAR for any other docs" but 90% of the time those use cases can be better solved by making a BAR_significant field that only contains the significant words in BAR and querying both. for the other 10% of the time, you'll have to wait for Payload based queries to come along, so when you add FOO to BAR you can give it a payload that says how important it is. (or so the Query Payload people say ... i choose to believe them even though i don't understand them) -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org