beam-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (Jira)" <>
Subject [jira] [Work logged] (BEAM-10124) ContextualTextIO - An IO that is provides metadata about the line.
Date Mon, 05 Oct 2020 17:50:00 GMT


ASF GitHub Bot logged work on BEAM-10124:

                Author: ASF GitHub Bot
            Created on: 05/Oct/20 17:49
            Start Date: 05/Oct/20 17:49
    Worklog Time Spent: 10m 
      Work Description: lukecwik merged pull request #12924:


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:

Issue Time Tracking

            Worklog Id:     (was: 495477)
    Remaining Estimate: 1,320h 40m  (was: 1,320h 50m)
            Time Spent: 23h 20m  (was: 23h 10m)

> ContextualTextIO - An IO that is provides metadata about the line. 
> -------------------------------------------------------------------
>                 Key: BEAM-10124
>                 URL:
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-ideas
>            Reporter: Reza ardeshir rokni
>            Assignee: Reza ardeshir rokni
>            Priority: P0
>             Fix For: 2.26.0
>   Original Estimate: 1,344h
>          Time Spent: 23h 20m
>  Remaining Estimate: 1,320h 40m
> There are two important Source IO's that allow for dealing with text files FileIO and
TextIO. When the requirements go beyond the scope of TextIO we ask the end user to make use
of FileIO and go it on their own.
> We want to correct this experience by creating a more feature rich TextIO which can return
along with each line of data contextual information about the line. For example the file that
it came from, and the ordinal position of the line within the file.
> Another area that we would like to deal with is more formal support for CSV files by
allowing compliance to RFC4180 files. Specifically the RFC allows for line breaks (CRLF) to
be used within fields within double quotes.

This message was sent by Atlassian Jira

View raw message