Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A92D6F60C for ; Thu, 4 Apr 2013 22:27:24 +0000 (UTC) Received: (qmail 88955 invoked by uid 500); 4 Apr 2013 22:27:22 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 88837 invoked by uid 500); 4 Apr 2013 22:27:22 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 88815 invoked by uid 99); 4 Apr 2013 22:27:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Apr 2013 22:27:22 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.192.172] (HELO mail-pd0-f172.google.com) (209.85.192.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Apr 2013 22:27:17 +0000 Received: by mail-pd0-f172.google.com with SMTP id 5so1677789pdd.3 for ; Thu, 04 Apr 2013 15:26:56 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:date:from:to:subject:in-reply-to:message-id:references :user-agent:mime-version:content-type:x-gm-message-state; bh=IGTFTEbxkRt4Y7NnGjYf5aCHIenmRIrzpsh6e6EiclM=; b=WoCPFliQBwgxW6AotGl8b7XtIRs6f3NE6KkrBlwMZORtXpXEA06P+GGyWTi1zkQoG8 T4jDj+i51qQq5Vv+RXed6R8U8p/oruNGHWC28sWfAKRtINXiK4+Eh+idJZUBR2/V+3iL gj3YdHAhHu/iKTm6/VivUxI+briRMGdX/K4m7iGZ3ZZzsxWoX068BA66EzizSgexk8ow 4fCDkVBhHwF/y1wFEgoNkWKFMIyzCPLyYXmoKILvDTh7MI+EnTdShmGdqPwdzJ7f5wAY PtRAFeAi7pbI9mUI6NPaaclWB0Hk2QEttChMBctchkMcqd8AvMfg/p51jAYd3GcK3rUd Wnrg== X-Received: by 10.66.154.76 with SMTP id vm12mr11983123pab.106.1365114416757; Thu, 04 Apr 2013 15:26:56 -0700 (PDT) Received: from frisbee.local (250.185-62-69.ftth.swbr.surewest.net. [69.62.185.250]) by mx.google.com with ESMTPS id li15sm13102700pab.2.2013.04.04.15.26.54 (version=TLSv1.1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Thu, 04 Apr 2013 15:26:55 -0700 (PDT) Date: Thu, 4 Apr 2013 15:26:52 -0700 (PDT) From: Chris Hostetter To: Lucene Users , paul_t100@fastmail.fm Subject: Re: Why does index boosting a field to 2.0f on a document have such a dramatic effect In-Reply-To: <515D6D18.80504@fastmail.fm> Message-ID: References: <515D6D18.80504@fastmail.fm> User-Agent: Alpine 2.02 (DEB 1266 2009-07-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Gm-Message-State: ALoCoQmGzsV1gYg8tuF5e6LeAmT3EU99TcXrVMbp1k4rNLZNsuPk+jmKsks0WDHa8bIVeY9r4yoO X-Virus-Checked: Checked by ClamAV on apache.org : At index time I boost the alias field of a small set of documents, setting the : boost to 2.0f, which I thought meant equivalent to doubling the score this doc : would get over another doc, everything else being equal. 1) you haven't shown us enough details to be certian, but based on the code you've provied it looks like you are adding a boost for *each* field instance named "alias" if the value of artistGuid is in your artistGuIdSet... : if(artistGuIdSet.contains(artistGuid)) { : for(IndexableField indexablefield:doc.getFields()) : { : if(indexablefield.name().equals(ArtistIndexField.ALIAS.getName())) : { : Field field = (Field)indexablefield; : field.setBoost(ARTIST_DOC_BOOST); ...so a doc with N values in the "alias" field is going to get a field boost of N*2. 2) Looking at the URL you mentioned : http://search.musicbrainz.org/?type=artist&query=Jean&explain=true ...the debug explanation currently produced by that URL says... 6.4894321E10 = (MATCH) weight(alias:jean in 7610) [MusicbrainzSimilarity], result of: ... 7.5161928E9 = fieldNorm(doc=7610) ou need to look at your "MusicbrainzSimilarity" class and it's fieldNorm method to determine for certain why it's producing such large values. we have no idea how that's implemented. -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org