nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lewis John Mcgibbney <lewis.mcgibb...@gmail.com>
Subject Re: RSS parser
Date Wed, 08 Feb 2012 10:54:13 GMT
Hi,

On Wed, Feb 8, 2012 at 8:44 AM, Michael Kazekin <
Michael.Kazekin@mediainsight.info> wrote:

>
> I tried your solution and got rid of "doesn't claim to support
> contentType" error indeed.
>

Maybe we should submit a patch for this indeed? Is it possible for you to
do this please?



> 2012-02-07 19:21:48,094 WARN  parse.ParseUtil - Unable to successfully
> parse content http://rss.sciam.com/sciam/earth-and-environment of type
> application/rss+xml
>
>
There is an issue for the feed plugin [1], can you please have a look
through and see if any of this looks familiar.
Thank you

[1] https://issues.apache.org/jira/browse/NUTCH-1053

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message