nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rafael Pappert ...@fwpsystems.com>
Subject Re: Generator: 0 records selected for fetching, exiting ...
Date Thu, 08 Dec 2011 21:56:57 GMT
I tried it with and without -topN Parameter … same result.
Now I removed the crawldb and did a new inject and ever things works fine …

don't know :(



On 8/Dec/ 2011, at 12:55 , Markus Jelsma wrote:

> Strange. What happens without -topN ?
> 
> On Thursday 08 December 2011 03:50:20 Rafael Pappert wrote:
>> Hello List,
>> 
>> my CrawlDb contains a few urls:
>> 
>> nutch readdb crawl/crawldb -stats
>> CrawlDb statistics start: crawl/crawldb
>> Statistics for CrawlDb: crawl/crawldb
>> TOTAL urls:	1832
>> retry 0:	1832
>> min score:	1.0
>> avg score:	1.0
>> max score:	1.0
>> status 1 (db_unfetched):	1832
>> CrawlDb statistics: done
>> 
>> but the generator always return "0 records selected" even with the
>> -noFilter -noNorm Parameter?
>> 
>> nutch generate crawl/crawldb crawl/segments -topN 100 -noNorm -noFilter
>> Generator: starting at 2011-12-08 03:37:20
>> Generator: Selecting best-scoring urls due for fetch.
>> Generator: filtering: false
>> Generator: normalizing: false
>> Generator: topN: 100
>> Generator: 0 records selected for fetching, exiting …
>> 
>> What prevents the generator from selecting urls for fetching?
>> 
>> Any hints?
>> 
>> Greets,
>> Rafael.
> 
> -- 
> Markus Jelsma - CTO - Openindex


Mime
View raw message