lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Anshum <ansh...@gmail.com>
Subject Re: Limiting search result for web search engine
Date Wed, 03 Feb 2010 01:57:16 GMT
Hi Mike,
Not really through queries, but you may do this by writing a custom
collector. You'd need some supporting data structure to mark/hash the
occurrence of a domain in your result set.

--
Anshum Gupta
Naukri Labs!
http://ai-cafe.blogspot.com

The facts expressed here belong to everybody, the opinions to me. The
distinction is yours to draw............


On Wed, Feb 3, 2010 at 6:56 AM, Mike Polzin <mikepolzin@yahoo.com> wrote:

> I am working on building a web search engine and I would like to build a
> reults page similar to what Google does. The functionality I am looking to
> include is what I refer to a "rolling up" sites, meaning that even if a
> particular site (defined by its base URL) has many relevent hits on various
> pages for the searches keywords, that site is only shown once in the results
> listing with a link to the most relevent hit on that site. What I do not
> want is to have one site dominate a search results page.
>
> Does it make sense to just do the search, get the hits list and then
> programatically remove the results which, although they meet the search
> criteria, are not as relevent? Is there a way to do this through queries?
>
> Thanks in advance!
>
> Mike
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message