pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Coveney (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-2600) Better Map support
Date Wed, 21 Mar 2012 16:59:40 GMT

    [ https://issues.apache.org/jira/browse/PIG-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234499#comment-13234499
] 

Jonathan Coveney commented on PIG-2600:
---------------------------------------

Hey Prashant, thanks again for your work on this. Took a look at the patch, looks good, though
I'm wondering if VALUELIST and VALUESET shouldn't return bags? I guess it depends how someone
plans to use it and you could convert easily from one to the other, but yeah... hmm. I think
the big question is how it'd be used. The awkward part about having it as a Tuple is that
it'd be impossible to do anything with the elements short of having a UDF to read them, whereas
if it's a bag, then we know the type of the keys (whatever it is in the Map), and it seems
more workable to me. This idea applies to all of the methods, really.
                
> Better Map support
> ------------------
>
>                 Key: PIG-2600
>                 URL: https://issues.apache.org/jira/browse/PIG-2600
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Jonathan Coveney
>            Assignee: Prashant Kommireddi
>             Fix For: 0.11
>
>         Attachments: PIG-2600.patch, PIG-2600_2.patch, PIG-2600_3.patch
>
>
> It would be nice if Pig played better with Maps. To that end, I'd like to add a lot of
utility around Maps.
> - TOBAG should take a Map and output {(key, value)}
> - TOMAP should take a Bag in that same form and make a map.
> - KEYSET should return the set of keys.
> - VALUESET should return the set of values.
> - VALUELIST should return the List of values (no deduping).
> - INVERSEMAP would return a Map of values => the set of keys that refer to that Key
> This would all be pretty easy. A more substantial piece of work would be to make Pig
support non-String keys (this is especially an issue since UDFs and whatnot probably assume
that they are all Integers). Not sure if it is worth it.
> I'd love to hear other things that would be useful for people!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message