Return-Path: Delivered-To: apmail-pig-dev-archive@www.apache.org Received: (qmail 6255 invoked from network); 7 Mar 2011 20:53:24 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 7 Mar 2011 20:53:24 -0000 Received: (qmail 13947 invoked by uid 500); 7 Mar 2011 20:53:24 -0000 Delivered-To: apmail-pig-dev-archive@pig.apache.org Received: (qmail 13914 invoked by uid 500); 7 Mar 2011 20:53:24 -0000 Mailing-List: contact dev-help@pig.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pig.apache.org Delivered-To: mailing list dev@pig.apache.org Received: (qmail 13863 invoked by uid 500); 7 Mar 2011 20:53:24 -0000 Delivered-To: apmail-hadoop-pig-dev@hadoop.apache.org Received: (qmail 13846 invoked by uid 99); 7 Mar 2011 20:53:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Mar 2011 20:53:23 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Mar 2011 20:53:21 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id E9F1A39CC19 for ; Mon, 7 Mar 2011 20:52:59 +0000 (UTC) Date: Mon, 7 Mar 2011 20:52:59 +0000 (UTC) From: "Thejas M Nair (JIRA)" To: pig-dev@hadoop.apache.org Message-ID: <2001127382.2387.1299531179954.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Commented: (PIG-1618) Switch to new parser generator technology MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/PIG-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13003588#comment-13003588 ] Thejas M Nair commented on PIG-1618: ------------------------------------ The new parser changes also fixes one behavior seen in earlier versions including 0.8 . With the changes in trunk - flatten of bag with null schema will result in a null schema. For example, in 0.8 - {code} grunt> describe g; g: {group: bytearray,a: {null}} grunt> f = foreach g generate $0 , flatten(a); grunt> describe f; f: {group: bytearray,bytearray} {code} in trunk with new parser changes - {code} grunt> describe g; g: {group: bytearray,a: {(null)}} grunt> f = foreach g generate $0 , flatten(a); grunt> describe f; Schema for f unknown. {code} > Switch to new parser generator technology > ----------------------------------------- > > Key: PIG-1618 > URL: https://issues.apache.org/jira/browse/PIG-1618 > Project: Pig > Issue Type: Improvement > Affects Versions: 0.8.0 > Reporter: Alan Gates > Assignee: Xuefu Zhang > Fix For: 0.9.0 > > Attachments: NewParser-1.patch, NewParser-10.patch, NewParser-11.patch, NewParser-12.patch, NewParser-13.2.patch, NewParser-13.patch, NewParser-14.patch, NewParser-15.patch, NewParser-18.patch, NewParser-19.3.patch, NewParser-19.patch, NewParser-2.patch, NewParser-3.patch, NewParser-3.patch, NewParser-4.patch, NewParser-5.patch, NewParser-6.patch, NewParser-7.patch, NewParser-8.patches, NewParser-9.patch, antlr-3.2.jar, javadoc.patch > > > There are many bugs in Pig related to the parser, particularly to bad error messages. After review of Java CC we feel these will be difficult to address using that tool. Also, the .jjt files used by JavaCC are hard to understand and maintain. > ANTLR is being reviewed as the most likely choice to move to, but other parsers will be reviewed as well. > This JIRA will act as an umbrella issue for other parser issues. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira