Return-Path: X-Original-To: apmail-pig-dev-archive@www.apache.org Delivered-To: apmail-pig-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A761173D9 for ; Thu, 14 Jul 2011 16:43:26 +0000 (UTC) Received: (qmail 3380 invoked by uid 500); 14 Jul 2011 16:43:25 -0000 Delivered-To: apmail-pig-dev-archive@pig.apache.org Received: (qmail 3337 invoked by uid 500); 14 Jul 2011 16:43:25 -0000 Mailing-List: contact dev-help@pig.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pig.apache.org Delivered-To: mailing list dev@pig.apache.org Received: (qmail 2587 invoked by uid 500); 14 Jul 2011 16:43:24 -0000 Delivered-To: apmail-hadoop-pig-dev@hadoop.apache.org Received: (qmail 2567 invoked by uid 99); 14 Jul 2011 16:43:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Jul 2011 16:43:24 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 14 Jul 2011 16:43:21 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 17864574AA for ; Thu, 14 Jul 2011 16:43:00 +0000 (UTC) Date: Thu, 14 Jul 2011 16:43:00 +0000 (UTC) From: "Dmitriy V. Ryaboy (JIRA)" To: pig-dev@hadoop.apache.org Message-ID: <1032823521.14042.1310661780093.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1843783547.42319.1309135307526.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (PIG-2143) Improvements for PigStorage MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/PIG-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitriy V. Ryaboy updated PIG-2143: ----------------------------------- Attachment: PIG-2143.2.diff Thanks for the reviews. Uploading a patch that fixes the repeated deserialization (nice catch!), adjusts whitespace, and makes the piggybank stuff shallow deprecated proxies for the builtins. I am not sure if loading the schema when it was created but isn't being requested is a good idea.. can see arguments both ways. I do think we should allow loading with a different delimiter than that set in the schema. > Improvements for PigStorage > --------------------------- > > Key: PIG-2143 > URL: https://issues.apache.org/jira/browse/PIG-2143 > Project: Pig > Issue Type: Improvement > Reporter: Dmitriy V. Ryaboy > Assignee: Dmitriy V. Ryaboy > Fix For: 0.10 > > Attachments: PIG-2143.2.diff, PIG-2143.diff > > > I'd like to propose that we allow for a greater degree of customization in PigStorage. > An incomplete list features that we might want to add: > - flag to tell it to overwrite existing output if it exists > - flag to tell it to compress output using gzip|bzip|lzo (currently this can be achieved by setting the directory name to end in .gz or .bz2, which is a bit awkward) > - flag to tell it to store the schema and header (perhaps by merging in PigStorageSchema work?) -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira