pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gianmarco De Francisci Morales (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-2743) Output Schema
Date Fri, 08 Jun 2012 14:31:23 GMT

    [ https://issues.apache.org/jira/browse/PIG-2743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291801#comment-13291801

Gianmarco De Francisci Morales commented on PIG-2743:

The alternative option would be to prepend the rank to the tuple (akin to line numbers).
The advantage would be you always know where your rank field will end up (i.e. $0).
But I have no strong opinion on it.
Anybody else cares to comment?
> Output Schema
> -------------
>                 Key: PIG-2743
>                 URL: https://issues.apache.org/jira/browse/PIG-2743
>             Project: Pig
>          Issue Type: Sub-task
>            Reporter: Allan AvendaƱo
>            Assignee: Allan AvendaƱo
> For the rank operator, I was considering the following schema:
> E.g.
> A = load 'data' as (x:int,y:chararray,z:int,rz:chararray);
> C = rank A by x;
> So the output schema could be: 
> C: {x: int,y: chararray,z: int,rz: chararray,A::rank: int}
> In general 
> {<schema_of_working_alias>,<alias>::rank#int}

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message