nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andre Schild" <a.sch...@aarboard.ch>
Subject AW: How do I enable PDF/Word etc. parsing in nutch?
Date Wed, 04 May 2005 16:02:40 GMT
> One thing:
> 
> Create a <nutch_home>/nutch-site.xml instead of modifying 
> nutch-default.xml
> 

Another one:

put in higher value for http.content.limit in you config file,
otherwise downloads of larger PDF's will not work.

André



Mime
View raw message