incubator-lucy-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From goran kent <gorank...@gmail.com>
Subject Re: [lucy-user] Collapsing search results based on a field
Date Sat, 17 Sep 2011 05:46:12 GMT
On Sat, Sep 17, 2011 at 12:56 AM, Marvin Humphrey
<marvin@rectangular.com> wrote:
> On Fri, Sep 16, 2011 at 03:00:21PM +0200, goran kent wrote:
>> Any support for collapsing duplicate documents based on a field?
>
> I wrote a DedupingSearcher class for KinoSearch a while ago that did exactly
> this, and I'd be happy to contribute it to the ASF.  It will take some
> modernizing to get it compatible with Lucy, though.

Any possibility of squeezing that into your schedule?

>
>> Such a thing possible?
>
> The algorithm is to rerun the search if there is not sufficient diversity in
> the search results, adding exclusions to the query each time to suppress the
> unwanted hits.

ouch, that doesn't sound good for performance.  Am I right?

Mime
View raw message