Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: java-user@lucene.apache.org
Received-SPF: pass (athena.apache.org: local policy)
Subject: RE: Sorting in lucene through Document boosting
MIME-Version: 1.0
Content-Type: text/plain;
	charset="US-ASCII"
Content-Transfer-Encoding: quoted-printable
Date: Mon, 15 Sep 2008 13:08:39 +0100
Message-ID: <ED024AB4B57C8543A3425147237C45F8451242@MAIL.DIOSPHERE.com>
Content-class: urn:content-classes:message
Thread-Topic: Sorting in lucene through Document boosting
Thread-Index: AckV9Pmf3vw+VMXHToKhOdI+rS/WhQBNhZtw
References: <ED024AB4B57C8543A3425147237C45F84511AF@MAIL.DIOSPHERE.com>
 <Pine.LNX.4.62.0809131539380.30340@radix.cryptio.net>
From: "Dragan Jotanovic" <Dragan.Jotanovic@DIOSPHERE.com>
To: <java-user@lucene.apache.org>

Thanks Chris.

I made simple Similarity implementation:

	public float lengthNorm(String arg0, int arg1) {
		return 1f;
	}

	public float tf(float arg0) {
		return 1f;
	}

My boost values are calculated simply by calling:=20
	document.setBoost(DefaultSimilarity.decodeNorm((byte)rank));

It works perfectly. I just need to check if I gain something with this,
in terms of performance and resource consumption.


-----Original Message-----
From: Chris Hostetter [mailto:hossman_lucene@fucit.org]=20
Sent: Saturday, September 13, 2008 11:59 PM
To: java-user@lucene.apache.org
Subject: Re: Sorting in lucene through Document boosting


: I thought of setting boost value for documents at index time, with the
: value of my sort field, and then making custom Similarity class which
: would disregard Lucene scoring and take in evaluation only this
document
: boost.

the general idea should work, but a few things to pay attention to...

1) document boosts are folded into the fieldNorm, so make sure you don't
"setOmitNorms(true)"

2) your lengthNorm function needs to return a constant

3) you'll need to adjust your boost values so that when the fieldNorms
are=20
converted to the internal 'byte' representation they are still unique
...=20
with some simple experimentation you can find an approach that helps you

genreate a mapping from 1,2,3,4,5... to a,b,c,d,... where a<b<c<....


-Hoss

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org