incubator-crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gabriel Reid (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-97) Add helpers for parsing PCollection<String> instances
Date Tue, 16 Oct 2012 19:03:03 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-97?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13477263#comment-13477263
] 

Gabriel Reid commented on CRUNCH-97:
------------------------------------

Looks very cool. It would be good to add some javadoc to the Parse.Builder class methods,
just to make it clear that they basically all map to java.util.Scanner parameters.

Also, what do you think of moving putting this in a new package: org.apache.crunch.lib.text?
I'm not totally sure about that either way, but it just seems that everything else in o.a.c.lib
is pretty generic, and this is specific to string data.
                
> Add helpers for parsing PCollection<String> instances
> -----------------------------------------------------
>
>                 Key: CRUNCH-97
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-97
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Josh Wills
>            Assignee: Josh Wills
>             Fix For: 0.4.0
>
>         Attachments: CRUNCH-97.patch
>
>
> We should make it a bit easier to parse delimited text files into specific data types
(e.g., ints, floats, etc.) or combinations of types-- e.g., pairs of strings and ints, a Tuple3
of booleans, etc.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message