arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Antoine Pitrou (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ARROW-3656) [C++] Allow whitespace in numeric CSV fields
Date Tue, 30 Oct 2018 17:06:00 GMT
Antoine Pitrou created ARROW-3656:
-------------------------------------

             Summary: [C++] Allow whitespace in numeric CSV fields
                 Key: ARROW-3656
                 URL: https://issues.apache.org/jira/browse/ARROW-3656
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
    Affects Versions: 0.11.0
            Reporter: Antoine Pitrou
            Assignee: Antoine Pitrou


Pandas allows whitespace before and after numbers in CSV files, but Arrow doesn't:
{code:python}
>>> s = b"a,b,c\n12 , 34 , 56\n"
>>> pd.read_csv(io.BytesIO(s))
    a   b   c
0  12  34  56
>>> csv.read_csv(io.BytesIO(s)).to_pandas()
        a        b       c
0  b'12 '  b' 34 '  b' 56'
{code}




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message