nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jason Manfield <rarish...@yahoo.com>
Subject Re: How do I enable PDF/Word etc. parsing in nutch?
Date Mon, 02 May 2005 17:36:04 GMT
Which config file? Is it mime.types? Can you pl give me an example? I would like detect the
file type (e.g. pdf) and then apply my own doc conversion utility for further processing.
 
Thanks
 
Jason

EM <emilijan@cpuedge.com> wrote:
add it to the list of plugins in your config file

Jason Manfield wrote:

> 
>__________________________________________________
>Do You Yahoo!?
>Tired of spam? Yahoo! Mail has the best spam protection around 
>http://mail.yahoo.com 
> 
>

		
---------------------------------
Do you Yahoo!?
 Yahoo! Small Business - Try our new resources site! 
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message