Return-Path: X-Original-To: apmail-flink-issues-archive@minotaur.apache.org Delivered-To: apmail-flink-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0000910E41 for ; Wed, 19 Nov 2014 20:49:20 +0000 (UTC) Received: (qmail 47118 invoked by uid 500); 19 Nov 2014 20:49:20 -0000 Delivered-To: apmail-flink-issues-archive@flink.apache.org Received: (qmail 47079 invoked by uid 500); 19 Nov 2014 20:49:20 -0000 Mailing-List: contact issues-help@flink.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.incubator.apache.org Delivered-To: mailing list issues@flink.incubator.apache.org Received: (qmail 47070 invoked by uid 99); 19 Nov 2014 20:49:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Nov 2014 20:49:20 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 19 Nov 2014 20:49:19 +0000 Received: (qmail 46910 invoked by uid 99); 19 Nov 2014 20:48:59 -0000 Received: from tyr.zones.apache.org (HELO tyr.zones.apache.org) (140.211.11.114) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Nov 2014 20:48:59 +0000 Received: by tyr.zones.apache.org (Postfix, from userid 65534) id 4066B945303; Wed, 19 Nov 2014 20:48:59 +0000 (UTC) From: fhueske To: issues@flink.incubator.apache.org Reply-To: issues@flink.incubator.apache.org References: In-Reply-To: Subject: [GitHub] incubator-flink pull request: enable CSV Reader to ignore invalid ... Content-Type: text/plain Message-Id: <20141119204859.4066B945303@tyr.zones.apache.org> Date: Wed, 19 Nov 2014 20:48:59 +0000 (UTC) X-Virus-Checked: Checked by ClamAV on apache.org Github user fhueske commented on a diff in the pull request: https://github.com/apache/incubator-flink/pull/201#discussion_r20606237 --- Diff: flink-core/src/main/java/org/apache/flink/api/common/io/GenericCsvInputFormat.java --- @@ -269,6 +283,21 @@ public void open(FileInputSplit split) throws IOException { protected boolean parseRecord(Object[] holders, byte[] bytes, int offset, int numBytes) throws ParseException { + if (commentPrefix != null) { + //check record for comments --- End diff -- Your logic checks for comments anywhere in a line. It would even identify comments which are contained in an escaped String. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---