Return-Path: X-Original-To: apmail-nutch-user-archive@www.apache.org Delivered-To: apmail-nutch-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ACD9C9ED5 for ; Wed, 8 Feb 2012 10:54:50 +0000 (UTC) Received: (qmail 84910 invoked by uid 500); 8 Feb 2012 10:54:49 -0000 Delivered-To: apmail-nutch-user-archive@nutch.apache.org Received: (qmail 84694 invoked by uid 500); 8 Feb 2012 10:54:42 -0000 Mailing-List: contact user-help@nutch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@nutch.apache.org Delivered-To: mailing list user@nutch.apache.org Received: (qmail 84686 invoked by uid 99); 8 Feb 2012 10:54:39 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Feb 2012 10:54:39 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lewis.mcgibbney@gmail.com designates 209.85.160.182 as permitted sender) Received: from [209.85.160.182] (HELO mail-gy0-f182.google.com) (209.85.160.182) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Feb 2012 10:54:34 +0000 Received: by ghbg15 with SMTP id g15so175359ghb.27 for ; Wed, 08 Feb 2012 02:54:13 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=dcEFo9/a0XW6856xbi0U3mcdqznExIX0BKGt6pGISnA=; b=La6NaXr7TuxdWmE5RDpMR6qmyGfsUrWKL6QoMM1OccI8YBxYP+BlFzafc/PvYRwIAv eH0fzKRV39dTFgWcy22lWGE3JnVaDby610vHloxWfxxvyN/9wCJuWWlAG1huLI+ea9u6 ml3yEbDn0XTBqKHJSxd+zQ/d7ME3M4Vzxv3Rs= MIME-Version: 1.0 Received: by 10.236.155.226 with SMTP id j62mr36195797yhk.49.1328698453907; Wed, 08 Feb 2012 02:54:13 -0800 (PST) Received: by 10.236.195.68 with HTTP; Wed, 8 Feb 2012 02:54:13 -0800 (PST) In-Reply-To: <4F3235F2.800@mediainsight.info> References: <4F2FD14A.6060708@mediainsight.info> <1328627247202-3722778.post@n3.nabble.com> <4F3235F2.800@mediainsight.info> Date: Wed, 8 Feb 2012 10:54:13 +0000 Message-ID: Subject: Re: RSS parser From: Lewis John Mcgibbney To: user@nutch.apache.org Content-Type: multipart/alternative; boundary=20cf303b3c875bd86104b871b762 --20cf303b3c875bd86104b871b762 Content-Type: text/plain; charset=ISO-8859-1 Hi, On Wed, Feb 8, 2012 at 8:44 AM, Michael Kazekin < Michael.Kazekin@mediainsight.info> wrote: > > I tried your solution and got rid of "doesn't claim to support > contentType" error indeed. > Maybe we should submit a patch for this indeed? Is it possible for you to do this please? > 2012-02-07 19:21:48,094 WARN parse.ParseUtil - Unable to successfully > parse content http://rss.sciam.com/sciam/earth-and-environment of type > application/rss+xml > > There is an issue for the feed plugin [1], can you please have a look through and see if any of this looks familiar. Thank you [1] https://issues.apache.org/jira/browse/NUTCH-1053 --20cf303b3c875bd86104b871b762--