nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Patrick Mézard <>
Subject Reconfiguring scoring plugin
Date Thu, 23 Jul 2020 09:09:09 GMT

I have crawled a first document set using a combination of depth and opic scoring plugins.
I would like to add the similarity scoring plugin but obviously the crawldb scores should
be updated for it and following "generate" phases to be effective. Is there a recommended
approach to achieve this?

My current understanding is since the similarity plugin operates in parse phase, I would have
to remove all parsed data from segments, re-parse them and updatedb? Would that work? Is there
anything smarter?

Patrick Mézard

View raw message