Return-Path: X-Original-To: apmail-flink-issues-archive@minotaur.apache.org Delivered-To: apmail-flink-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 69593C049 for ; Fri, 14 Nov 2014 20:03:00 +0000 (UTC) Received: (qmail 97607 invoked by uid 500); 14 Nov 2014 20:03:00 -0000 Delivered-To: apmail-flink-issues-archive@flink.apache.org Received: (qmail 97566 invoked by uid 500); 14 Nov 2014 20:03:00 -0000 Mailing-List: contact issues-help@flink.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.incubator.apache.org Delivered-To: mailing list issues@flink.incubator.apache.org Received: (qmail 97557 invoked by uid 99); 14 Nov 2014 20:03:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Nov 2014 20:03:00 +0000 X-ASF-Spam-Status: No, hits=-2000.6 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 14 Nov 2014 20:02:59 +0000 Received: (qmail 97166 invoked by uid 99); 14 Nov 2014 20:02:38 -0000 Received: from tyr.zones.apache.org (HELO tyr.zones.apache.org) (140.211.11.114) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Nov 2014 20:02:38 +0000 Received: by tyr.zones.apache.org (Postfix, from userid 65534) id 9F76A941B04; Fri, 14 Nov 2014 20:02:38 +0000 (UTC) From: fhueske To: issues@flink.incubator.apache.org Reply-To: issues@flink.incubator.apache.org References: In-Reply-To: Subject: [GitHub] incubator-flink pull request: enable CSV Reader to ignore invalid ... Content-Type: text/plain Message-Id: <20141114200238.9F76A941B04@tyr.zones.apache.org> Date: Fri, 14 Nov 2014 20:02:38 +0000 (UTC) X-Virus-Checked: Checked by ClamAV on apache.org Github user fhueske commented on a diff in the pull request: https://github.com/apache/incubator-flink/pull/201#discussion_r20382587 --- Diff: flink-runtime/src/main/java/org/apache/flink/runtime/operators/DataSourceTask.java --- @@ -192,8 +192,9 @@ public void invoke() throws Exception { // as long as there is data to read while (!this.taskCanceled && !format.reachedEnd()) { // build next pair and ship pair if it is valid - if ((record = format.nextRecord(record)) != null) { - output.collect(record); + OT r; --- End diff -- These modifications change Flink's behavior for any type of InputFormat, not just CSVInputFormats. If we want to allow to skip invalid lines, this is not the right place to add this feature. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---