nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Susam Pal" <>
Subject Re: site alias
Date Fri, 06 Jul 2007 07:32:26 GMT
I have faced this issue. I block the duplicate domain using the URL
filters. So only one domain is crawled by the bot and the other domain
is ignored.

Susam Pal

On 7/6/07, Nuther <> wrote:
> Hi,
> I was wondering if nutch has alias option
> Let's say we have two domains and that point on
> one site. How can I tell nutch that they pooint on that site? This is problem
> because there are a lot of duplicates in search results.
> Thanks.
> --
> Regards,
>  Nuther                

View raw message