lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From alessio crisantemi <alessio.crisant...@gmail.com>
Subject Re: nutch and solr
Date Wed, 22 Feb 2012 21:05:32 GMT
thanks for your reply, but don't work.
the same message: can't convert empty path

and more: impossible find class org.apache.nutch.crawl.injector

..


Il giorno 22 febbraio 2012 06:14, tamanjit.bindra@yahoo.co.in <
tamanjit.bindra@yahoo.co.in> ha scritto:

> Try this command.
>
>  bin/nutch crawl urls/<folder name>/<url file>.txt -dir crawl/<folders
> name>
> -threads 10 -depth 2 -topN 1000
>
> Your folder structure will look like this:
>
> <nutch folder>-- urls -- <folder name>-- <url file>.txt
>                    |
>                    |
>                     -- crawl -- <folder name>
>
> The folder name will be for different domains. So for each domain folder in
> urls folder there has to be a corresponding folder (with the same name) in
> the crawl folder.
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/nutch-and-solr-tp3765166p3765607.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message