nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "nutch.newbie (JIRA)" <>
Subject [jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser
Date Fri, 09 Feb 2007 20:22:06 GMT


nutch.newbie commented on NUTCH-443:


Frankly my comments are regarding feedparser and I must say I am great full for the rss-plugin
and the hard work you put in. You have decided to go for feedparser cos you thought it was
the correct solution. So please don't take this personally. 

According to SVN the last update was
done regarding feedparser was 12 months ago plud there are no Atom 1.0 support. This is how
I like to put it and frankly it doesn't matter ..

1. The goal of nutch to be an alternative to open source google.
2. you can't have a dead end feedparser as your fundamental feed parsing soluttion where the
project is not moving for the last 12 months!  Well go figure why people think its apache

Sorry I brusted like this. in one hand nutch would like to preach that it is the alternative
to google and on the other hand it uses technology that is no longer active ..thats all. 

> allow parsers to return multiple Parse object, this will speed up the rss parser
> --------------------------------------------------------------------------------
>                 Key: NUTCH-443
>                 URL:
>             Project: Nutch
>          Issue Type: New Feature
>          Components: fetcher
>    Affects Versions: 0.9.0
>            Reporter: Renaud Richardet
>            Priority: Minor
>             Fix For: 0.9.0
>         Attachments: NUTCH-443-draft-v1.patch, NUTCH-443-draft-v2.patch, parse-map-core-draft-v1.patch,
parse-map-core-untested.patch, parsers.diff
> allow Parser#parse to return a Map<String,Parse>. This way, the RSS parser can
return multiple parse objects, that will all be indexed separately. Advantage: no need to
fetch all feed-items separately.
> see the discussion at

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message