nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Seth Taylor" <>
Subject ASP Parser
Date Tue, 10 May 2005 15:53:18 GMT
I've recently just installed and configured Nutch from source.  From
what I've read by default, Nutch will parse text and html based
documents only.  I have a site I'm trying to crawl which is all asp
pages.  I put the asp mime type in the mime-type.xml document.  What
else do I need to do in order for Nutch to crawl asp pages?




  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message