incubator-crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matthias Friedrich (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-97) Add helpers for parsing PCollection<String> instances
Date Wed, 12 Dec 2012 20:26:20 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-97?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13530291#comment-13530291
] 

Matthias Friedrich commented on CRUNCH-97:
------------------------------------------

Like Gabriel, I think a text parsing library would be a useful addition and I'm sorry for
being unable to offer a design that covers all use cases. We need more feedback, perhaps someone
else has a good idea, but I guess that'll only happen when it's part of the release or at
least part of the source tree. How about adding it to contrib or add a beta marker to the
javadoc and ask for feedback? This way we can still change it in case we come up with the
perfect design (whatever that is) and users get a fair warning that it's not stable yet.
                
> Add helpers for parsing PCollection<String> instances
> -----------------------------------------------------
>
>                 Key: CRUNCH-97
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-97
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Josh Wills
>            Assignee: Josh Wills
>             Fix For: 0.5.0
>
>         Attachments: CRUNCH-97.patch, CRUNCH-97-take2.patch, CRUNCH-97-Tokenizer-v1.patch,
CRUNCH-97v3.patch, CRUNCH-97v4.patch
>
>
> We should make it a bit easier to parse delimited text files into specific data types
(e.g., ints, floats, etc.) or combinations of types-- e.g., pairs of strings and ints, a Tuple3
of booleans, etc.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message