lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marcus Herou <marcus.he...@tailsweep.com>
Subject Re: exponential boosts
Date Thu, 23 Apr 2009 21:08:13 GMT
But perhaps one could use a FieldCache somehow ?

/M

On Thu, Apr 23, 2009 at 11:07 PM, Marcus Herou
<marcus.herou@tailsweep.com>wrote:

> Yes I have considered it for 30 minutes :)
>
> How do one apply that in the real world ?
>
> If the only thing I get access to is the actual docId would it not be
> really expensive to get the Document itself from the index and later use
> some field in it as external lookup in some optimized structure for this ?
>
> Example, pseudo:
>
> *public* *float* customScore(*int* doc, *float* subQueryScore, *float* valSrcScore)
>
> {
>         *Document document = indexSearcher.doc(doc);
>         float score = MyOptimalHashStructure.getScore(document.get("someId"));
>         return score**subQueryScore*;*
>
> }
>
> This would not scale well right ? I mean gathering scores through 100M docs
> would take some time I guess ? Or even 1M docs...
>
> Please push me in the right direction.
>
> Cheers
>
> //Marcus
>
>
>
>
>
>
>
> On Thu, Apr 23, 2009 at 10:58 PM, Doron Cohen <cdoronc@gmail.com> wrote:
>
>> >
>> > I think we are doing similar things, at least I am trying to implement
>> > document boosting with pagerank. Having issues of howto appky the
>> scoring
>> > of
>> > specific docs without actually reindex them. I feel something should be
>> > done
>> > at query time which looks at external data but do not know howto
>> implement
>> > that. Do you ?
>> >
>>
>> Have you considered CustomScoreQuery in o.a.l.search.function ? It should
>> allow
>> incorporating external scores.
>>
>> Doron
>>
>
>
>
> --
> Marcus Herou CTO and co-founder Tailsweep AB
> +46702561312
> marcus.herou@tailsweep.com
> http://www.tailsweep.com/
> http://blogg.tailsweep.com/
>



-- 
Marcus Herou CTO and co-founder Tailsweep AB
+46702561312
marcus.herou@tailsweep.com
http://www.tailsweep.com/
http://blogg.tailsweep.com/

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message