Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 75425 invoked from network); 31 Jul 2007 18:39:03 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 31 Jul 2007 18:39:03 -0000 Received: (qmail 98554 invoked by uid 500); 31 Jul 2007 18:38:56 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 98519 invoked by uid 500); 31 Jul 2007 18:38:56 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 98508 invoked by uid 99); 31 Jul 2007 18:38:56 -0000 Received: from Unknown (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Jul 2007 11:38:56 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of erickerickson@gmail.com designates 209.85.128.184 as permitted sender) Received: from [209.85.128.184] (HELO fk-out-0910.google.com) (209.85.128.184) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Jul 2007 18:38:50 +0000 Received: by fk-out-0910.google.com with SMTP id z23so468780fkz for ; Tue, 31 Jul 2007 11:38:28 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=LsWEEgf+MHPX6W43OGU2z2Fdzb3NFkI9cgPlp046W+vwqYLCTA+cf8Yq08TvhWjkltf6FFHOOJjiMd2UbF5Z2WgviXtW+JwETpP+4GaJhcRVtCs9g+5cjG++6IN2zReWRbwn/YJsXrImv4/UWOTIPUKq4Vfi1Zap/s0KrhBqXcA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=Sr1XV6G0D4dXtopqAk6zxhtMatngcTgSrHm7ktv7DBkn0vRlDH72boT5mropdsvZOoJLcD5QgCqgMXrhJk/JAhzGCeIj2CPXTffjN1+npkWJHTFn0NaTzqKmaHJ27JRX8KIzf5wL2Tdn0RXwiLd2je2N7LhHPdyGPeDb2EzNycE= Received: by 10.82.108.9 with SMTP id g9mr6026233buc.1185907108071; Tue, 31 Jul 2007 11:38:28 -0700 (PDT) Received: by 10.82.190.14 with HTTP; Tue, 31 Jul 2007 11:38:28 -0700 (PDT) Message-ID: <359a92830707311138r50dc8504jdac65c344dbdb2b4@mail.gmail.com> Date: Tue, 31 Jul 2007 14:38:28 -0400 From: "Erick Erickson" To: java-user@lucene.apache.org Subject: Re: Lucene Field score value In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_34912_7860625.1185907108038" References: <359a92830707311122i54bc49d4y9a8f34b068aaca9e@mail.gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_34912_7860625.1185907108038 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Boost the other three fields at search time. Boosting during index time expresses "this document's title is worth more than other doucments' titles". Boosting during search time expresses "I care about matches on this clause more than I do on other clauses". Will it help? How should I know? It's *your* application and *you* are the one who is dissatisfied with the current scoring . All I can say is try it and find out. You might consider using Luke to try various boosts without having to mess with too much code. Erick On 7/31/07, Askar Zaidi wrote: > > Boosting during Indexing or boosting during search ? > > I have 4 fields: > > {tags},{title},{summary},{contents} > > Typically a phrase occurs too many times in contents as compared to the > other fields. If I get the score of contents field , I can pass it through > an adjuster function which will bring the score down. Something like: > > public static double adjuster(double count){ > > double newCount; > newCount = 1/Math.exp(count); > System.out.println(newCount); > return newCount; > > } > > Do you mean I could boost the values of the other 3 fields ? Will that > help > ? Would there be a way to bring down the score of the contents field ? > > thanks, > AZ > > > On 7/31/07, Erick Erickson wrote: > > > > Wouldn't boosting handle this for you? > > > > On 7/31/07, Askar Zaidi wrote: > > > > > > To be more specific: > > > > > > I want to retrieve the scores of individual fields inside a document > so > > > that > > > I can manipulate the score of one field. This is the requirement of my > > > application. After the manipulation I can add these scores and then > show > > > the > > > total. > > > > > > thanks, > > > > > > AZ > > > > > > On 7/31/07, Askar Zaidi wrote: > > > > > > > > Hey guys, > > > > > > > > I was wondering if there is a way to retrieve score of a field in a > > > > document ? > > > > > > > > If my document looks like this: > > > > > > > > {itemID},{field 1},{field 2} > > > > > > > > I'd like to get score of individual fields 1 and 2 rather than the > > score > > > > of the entire document. > > > > > > > > Is it possible ? > > > > > > > > thanks, > > > > AZ > > > > > > > > > > ------=_Part_34912_7860625.1185907108038--