I tried it with and without -topN Parameter … same result.
Now I removed the crawldb and did a new inject and ever things works fine …
don't know :(
On 8/Dec/ 2011, at 12:55 , Markus Jelsma wrote:
> Strange. What happens without -topN ?
>
> On Thursday 08 December 2011 03:50:20 Rafael Pappert wrote:
>> Hello List,
>>
>> my CrawlDb contains a few urls:
>>
>> nutch readdb crawl/crawldb -stats
>> CrawlDb statistics start: crawl/crawldb
>> Statistics for CrawlDb: crawl/crawldb
>> TOTAL urls: 1832
>> retry 0: 1832
>> min score: 1.0
>> avg score: 1.0
>> max score: 1.0
>> status 1 (db_unfetched): 1832
>> CrawlDb statistics: done
>>
>> but the generator always return "0 records selected" even with the
>> -noFilter -noNorm Parameter?
>>
>> nutch generate crawl/crawldb crawl/segments -topN 100 -noNorm -noFilter
>> Generator: starting at 2011-12-08 03:37:20
>> Generator: Selecting best-scoring urls due for fetch.
>> Generator: filtering: false
>> Generator: normalizing: false
>> Generator: topN: 100
>> Generator: 0 records selected for fetching, exiting …
>>
>> What prevents the generator from selecting urls for fetching?
>>
>> Any hints?
>>
>> Greets,
>> Rafael.
>
> --
> Markus Jelsma - CTO - Openindex
|