incubator-lucy-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From goran kent <gorank...@gmail.com>
Subject [lucy-user] Collapsing search results based on a field
Date Fri, 16 Sep 2011 13:00:21 GMT
Hi,

Any support for collapsing duplicate documents based on a field?

For example, if the following search result is returned:

$hit->{site} = www.abc.com
$hit->{myscore} = 100
title1...

$hit->{site} = www.abc.com
$hit->{myscore} = 99
title2...

$hit->{site} = www.cnn.com
...

I may want to collapse (de-duplicate) the above so it shows:

$hit->{site} = www.abc.com
$hit->{myscore} = 100
title1...

$hit->{site} = www.cnn.com
...


ie, the lowest scoring (perhaps based on SortSpec/SortRule) results
which share the same content for the field 'title' should be
suppressed.

Such a thing possible?

gk

Mime
View raw message