Return-Path: X-Original-To: apmail-pig-dev-archive@www.apache.org Delivered-To: apmail-pig-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CC465DA3E for ; Thu, 13 Sep 2012 06:07:10 +0000 (UTC) Received: (qmail 95349 invoked by uid 500); 13 Sep 2012 06:07:09 -0000 Delivered-To: apmail-pig-dev-archive@pig.apache.org Received: (qmail 95274 invoked by uid 500); 13 Sep 2012 06:07:09 -0000 Mailing-List: contact dev-help@pig.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pig.apache.org Delivered-To: mailing list dev@pig.apache.org Received: (qmail 95240 invoked by uid 500); 13 Sep 2012 06:07:08 -0000 Delivered-To: apmail-hadoop-pig-dev@hadoop.apache.org Received: (qmail 95217 invoked by uid 99); 13 Sep 2012 06:07:07 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Sep 2012 06:07:07 +0000 Date: Thu, 13 Sep 2012 17:07:07 +1100 (NCT) From: "Cheolsoo Park (JIRA)" To: pig-dev@hadoop.apache.org Message-ID: <1763536195.73553.1347516427653.JavaMail.jiratomcat@arcas> In-Reply-To: <91202480.43054.1346909767709.JavaMail.jiratomcat@arcas> Subject: [jira] [Updated] (PIG-2909) Add a new option for ignoring corrupted files to AvroStorage load func MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/PIG-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-2909: ------------------------------- Attachment: PIG-2909-2.patch > Add a new option for ignoring corrupted files to AvroStorage load func > ---------------------------------------------------------------------- > > Key: PIG-2909 > URL: https://issues.apache.org/jira/browse/PIG-2909 > Project: Pig > Issue Type: Improvement > Components: piggybank > Affects Versions: 0.10.0 > Reporter: Cheolsoo Park > Assignee: Cheolsoo Park > Attachments: PIG-2909-2.patch, PIG-2909-avro_test_files.tar.gz, PIG-2909.patch > > > Currently, AvroStorage load fails with AvroRuntimeException when encountering corrupted input files. For example, > {code} > ERROR 2997: Unable to recreate exception from backed error: java.io.IOException: org.apache.avro.AvroRuntimeException: java.io.IOException: Invalid sync! > at org.apache.pig.piggybank.storage.avro.AvroStorage.getNext(AvroStorage.java:283) > {code} > But it is not always desirable to fail the Pig job for bad files. It is sometimes more useful to skip them and continue. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira