nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] Commented: (NUTCH-30) rss feed parser
Date Mon, 04 Apr 2005 17:17:16 GMT
     [ http://issues.apache.org/jira/browse/NUTCH-30?page=comments#action_62111 ]
     
Chris A. Mattmann commented on NUTCH-30:
----------------------------------------

Hi Folks,

 One more comment on the parse-rss plugin that I've just attached: I also included a junit
test drawn from John X's junit test on the PDF parser. The rss junit test parses a sample
rss file and makes sure it reads the correct amount of outlinks, and the correct outlinks
from the RSS file.

Thanks,
  Chris


> rss feed parser
> ---------------
>
>          Key: NUTCH-30
>          URL: http://issues.apache.org/jira/browse/NUTCH-30
>      Project: Nutch
>         Type: Improvement
>   Components: fetcher
>     Reporter: Stefan Grroschupf
>     Priority: Minor
>  Attachments: RSSParserPatch.txt, RSS_Parser.zip, parse-rss-patch.txt, parse-rss.zip
>
> A simple rss feed parser supporting:
> rss and atom:
> + version 0.3
> +  version 09
> + version 10
> + version 20
> Converting of different rss versions  is done via xslt. 
> The xslt was contributed by Frank Henze - Thanks!

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira


Mime
View raw message