pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mridul Muralidharan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-1627) Flattening of bags with unknown schemas produces wrong schema
Date Fri, 08 Apr 2011 08:37:05 GMT

    [ https://issues.apache.org/jira/browse/PIG-1627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13017347#comment-13017347

Mridul Muralidharan commented on PIG-1627:

bytearray vs unknown schema use is always confusing.
The description in https://issues.apache.org/jira/browse/PIG-1876, for example, indicates
that unknown schema implies it should be bytearray (desc starts with : "Currently Pig map
type is untyped, which means map value is always of bytearray(ie. unknown) type." ..), while
this JIRA seems to indicate it is not the case !

I have seen varying interpretations of what bytearray is supposed to mean in the jira's, pig
docs and pig source code over the last 3+ years, not to mention in the various ilist's and
user source codebass - some clarity in this regard would be good and less confusing.

> Flattening of bags with unknown schemas produces wrong schema
> -------------------------------------------------------------
>                 Key: PIG-1627
>                 URL: https://issues.apache.org/jira/browse/PIG-1627
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.7.0
>            Reporter: Alan Gates
>            Assignee: Daniel Dai
>             Fix For: 0.9.0
> The following should produce an unknown schema:
> {code}
> A = load '/Users/gates/test/data/studenttab10';
> B = group A by $0;
> C = foreach B generate flatten(A);
> describe C;
> {code}
> Instead it gives
> {code}
> C: {bytearray}
> {code}

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message