hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Richard Ding (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1188) Padding nulls to the input tuple according to input schema
Date Tue, 16 Feb 2010 18:27:28 GMT

    [ https://issues.apache.org/jira/browse/PIG-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12834362#action_12834362
] 

Richard Ding commented on PIG-1188:
-----------------------------------

Actually, Pig is already padding nulls to the input tuple according to input schema (with
data types):

For example, given Pig script:

{code}
a = load '1.txt' as (a0:int, a1:int);
dump a;
{code}

and input file:

{code}
1       2
1       2       3
1
{code}

The result is

{code}
(1,2)
(1,2)
(1, null)
{code}


> Padding nulls to the input tuple according to input schema
> ----------------------------------------------------------
>
>                 Key: PIG-1188
>                 URL: https://issues.apache.org/jira/browse/PIG-1188
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.6.0
>            Reporter: Daniel Dai
>            Assignee: Richard Ding
>             Fix For: 0.7.0
>
>
> Currently, the number of fields in the input tuple is determined by the data. When we
have schema, we should generate input data according to the schema, and padding nulls if necessary.
Here is one example:
> Pig script:
> {code}
> a = load '1.txt' as (a0, a1);
> dump a;
> {code}
> Input file:
> {code}
> 1       2
> 1       2       3
> 1
> {code}
> Current result:
> {code}
> (1,2)
> (1,2,3)
> (1)
> {code}
> Desired result:
> {code}
> (1,2)
> (1,2)
> (1, null)
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message