nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Jelsma <markus.jel...@openindex.io>
Subject Re: Generator: 0 records selected for fetching, exiting ...
Date Thu, 08 Dec 2011 11:55:36 GMT
Strange. What happens without -topN ?

On Thursday 08 December 2011 03:50:20 Rafael Pappert wrote:
> Hello List,
> 
> my CrawlDb contains a few urls:
> 
> nutch readdb crawl/crawldb -stats
> CrawlDb statistics start: crawl/crawldb
> Statistics for CrawlDb: crawl/crawldb
> TOTAL urls:	1832
> retry 0:	1832
> min score:	1.0
> avg score:	1.0
> max score:	1.0
> status 1 (db_unfetched):	1832
> CrawlDb statistics: done
> 
> but the generator always return "0 records selected" even with the
> -noFilter -noNorm Parameter?
> 
> nutch generate crawl/crawldb crawl/segments -topN 100 -noNorm -noFilter
> Generator: starting at 2011-12-08 03:37:20
> Generator: Selecting best-scoring urls due for fetch.
> Generator: filtering: false
> Generator: normalizing: false
> Generator: topN: 100
> Generator: 0 records selected for fetching, exiting …
> 
> What prevents the generator from selecting urls for fetching?
> 
> Any hints?
> 
> Greets,
> Rafael.

-- 
Markus Jelsma - CTO - Openindex

Mime
View raw message