nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jason Manfield <>
Subject Re: How do I enable PDF/Word etc. parsing in nutch?
Date Mon, 02 May 2005 17:36:04 GMT
Which config file? Is it mime.types? Can you pl give me an example? I would like detect the
file type (e.g. pdf) and then apply my own doc conversion utility for further processing.

EM <> wrote:
add it to the list of plugins in your config file

Jason Manfield wrote:

>Do You Yahoo!?
>Tired of spam? Yahoo! Mail has the best spam protection around 

Do you Yahoo!?
 Yahoo! Small Business - Try our new resources site! 
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message