Return-Path: X-Original-To: apmail-manifoldcf-user-archive@www.apache.org Delivered-To: apmail-manifoldcf-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 140BC11C01 for ; Thu, 21 Aug 2014 21:55:16 +0000 (UTC) Received: (qmail 48646 invoked by uid 500); 21 Aug 2014 21:55:15 -0000 Delivered-To: apmail-manifoldcf-user-archive@manifoldcf.apache.org Received: (qmail 48583 invoked by uid 500); 21 Aug 2014 21:55:15 -0000 Mailing-List: contact user-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@manifoldcf.apache.org Delivered-To: mailing list user@manifoldcf.apache.org Received: (qmail 48571 invoked by uid 99); 21 Aug 2014 21:55:15 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 Aug 2014 21:55:15 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.213.176] (HELO mail-ig0-f176.google.com) (209.85.213.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 Aug 2014 21:55:11 +0000 Received: by mail-ig0-f176.google.com with SMTP id hn18so14016149igb.3 for ; Thu, 21 Aug 2014 14:54:50 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:date:message-id:subject:from:to :content-type; bh=74EmmlB0Zn6Q3pTM/wN133nvh4rKROrMVBmZU2WmJQA=; b=cPn1dNuW2b1hASJXvE8yAEGWKbxoqZcNEDgq+FsM5URijM1TQmxus/ttq70vQ/EF14 JYIsfbPHHaN5zigGTACyuwNMQg3+uQoGFRLhu2z0LJXslRZmCVfMSQcFcTgtvUjFbQLj Neb2wfpXONaMo5INIqIspyPay0mpn+394xrzzstaguNAxFiyiGHRrEpbUFPnlb4QchZG pXm0Pt6u/5ScdezEkSuo/9FXu9wIzdZa3DZCoe4J7ZD9EGXFC9HD5ccymPp+hf0LHNWX mK+1NBtCwNUUHmU8yMFJp50wAgKM2/O4DnEH45Uk/qbOuvdHqessLTfK+/khw8yjrDpe x1pA== X-Gm-Message-State: ALoCoQnAoFmhDVVDGd4agT5paU0wo04QZxLuMqUzgKboxGugvgS9mzXxhhb/QhKA7VfaNaHj6UbQ MIME-Version: 1.0 X-Received: by 10.50.115.73 with SMTP id jm9mr7099959igb.3.1408658090754; Thu, 21 Aug 2014 14:54:50 -0700 (PDT) Received: by 10.64.12.147 with HTTP; Thu, 21 Aug 2014 14:54:50 -0700 (PDT) Date: Thu, 21 Aug 2014 23:54:50 +0200 Message-ID: Subject: Problem with indexing RSS feeds in ElasticSearch From: Rene Nederhand To: user@manifoldcf.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org Hi everyone, I have a fresh installation of ManifoldCF 1.6.1 (from binary distribution) and trying to index RSS feeds in Elasticsearch (1.3.2). So, I created a job that ingests several feeds. However, it seems some feeds are parsed, but items won't end in the index. It fetches the items, but does not send these to ElasticSearch (ES). When I replace ES as output connector to a file based output connector, the items _are_ stored as HTML files as they should be. Examples: Working: http://www.nu.nl/feeds/rss/algemeen.rss Not working: http://www.hrpraktijk.nl/nieuws/feed http://www.penoactueel.nl/RSS/Feed/Laatste-nieuws-van-POactueel/ What could be the reason for this behaviour? Is there a solution? Should I create a ticket? Thanks, Rene