nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Jelsma <>
Subject Re: Ranking of injected urls vs crawled urls
Date Tue, 06 Dec 2011 09:18:59 GMT
What kind of scoring are you using? 

> Hi,
> I am using nutch/solr to crawl and search some websites. A large number of
> urls (all from the same domain) were injected directly, and the rest (from
> different domains) were obtained by crawling. The problem is that the
> injected urls seem to be ranking higher than the others, even when they
> seem to match the search worse.
> In nutch-default.xml it looks like the default scores for injected urls are
> the same as everything else. Is something else driving the scores down to
> less than the default for the others? How can I correct this?
> Thanks,
> Harris Rappaport

  • Unnamed multipart/alternative (inline, 7-Bit, 0 bytes)
View raw message