nutch-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Nutch Wiki] Update of "DevelopmentCommandLineOptions" by JeffRitchie
Date Sat, 04 Mar 2006 21:47:49 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by JeffRitchie:

The comment on the change is:
New Page - Removed Options no longer in nutch script.

New page:
= Command Line Options of nutch-0.8-dev/bin/nutch =

See each entry for datails of the command arguments and options.

||["nutch-0.8-dev/bin/nutch crawl"]||One-step crawler for intranets||
||["nutch-0.8-dev/bin/nutch readdb"]||Read / dump crawldb||
||["nutch-0.8-dev/bin/nutch readlinkdb"]||Read / dump linkdb||
||["nutch-0.8-dev/bin/nutch inject"]||Inject new urls into the crawldb||
||["nutch-0.8-dev/bin/nutch generate"]||Generate new segments to fetch||
||["nutch-0.8-dev/bin/nutch fetch"]||Fetch a segment's pages||
||["nutch-0.8-dev/bin/nutch parse"]||Parse contents in one segment||
||["nutch-0.8-dev/bin/nutch segread"]||Read data in an existing segment||
||["nutch-0.8-dev/bin/nutch updatedb"]||Updates the crawldb from a segment||
||["nutch-0.8-dev/bin/nutch invertlinks"]||Create or update a linkdb from a segment or segments||
||["nutch-0.8-dev/bin/nutch index"]||Run the indexer on a segment's fetcher output||
||["nutch-0.8-dev/bin/nutch merge"]||Merge several segment indexes||
||["nutch-0.8-dev/bin/nutch dedup"]||Deletes duplicate documents in a set of segment indexes||
||["nutch-0.8-dev/bin/nutch plugin"]||Load a plugin and run one of its classes main()||
||["nutch-0.8-dev/bin/nutch server"]||Run a search server||
||                          ||                                               ||

View raw message