pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Olga Natkovich (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1188) Padding nulls to the input tuple according to input schema
Date Wed, 02 Mar 2011 21:11:36 GMT

    [ https://issues.apache.org/jira/browse/PIG-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13001650#comment-13001650
] 

Olga Natkovich commented on PIG-1188:
-------------------------------------

The remaining work for Pig 0.9 on this issue is to make sure that the same behavior is implemented
for the case where schema is provided during load regardless of whether type information is
provided.

We want to implement the current behavior of the typed data for the untyped case.

I believe the way we agreed to do this is by adding foreach regardless of whether type information
is available

> Padding nulls to the input tuple according to input schema
> ----------------------------------------------------------
>
>                 Key: PIG-1188
>                 URL: https://issues.apache.org/jira/browse/PIG-1188
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.6.0
>            Reporter: Daniel Dai
>            Assignee: Daniel Dai
>             Fix For: 0.9.0
>
>
> Currently, the number of fields in the input tuple is determined by the data. When we
have schema, we should generate input data according to the schema, and padding nulls if necessary.
Here is one example:
> Pig script:
> {code}
> a = load '1.txt' as (a0, a1);
> dump a;
> {code}
> Input file:
> {code}
> 1       2
> 1       2       3
> 1
> {code}
> Current result:
> {code}
> (1,2)
> (1,2,3)
> (1)
> {code}
> Desired result:
> {code}
> (1,2)
> (1,2)
> (1, null)
> {code}

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message