nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki ...@getopt.org>
Subject Re: [nutch 0.5] frames
Date Thu, 07 Jul 2005 18:04:21 GMT
Philipp Suter wrote:
> does anybody know how to crawl frames? Or how to extend nutch to be able 
> to crawl frames? We are using the api.

The development version (available from SVN) should handle frames just 
fine, i.e. it should follow the src=... attributed in frames in order to 
retrieve the frame contents. Please download the nightly snapshot and 
try it out.


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Mime
View raw message