arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Miki Tebeka (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ARROW-488) [Python] Implement conversion between integer coded as floating points with NaN to an Arrow integer type
Date Thu, 09 Mar 2017 15:19:38 GMT

    [ https://issues.apache.org/jira/browse/ARROW-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15903199#comment-15903199
] 

Miki Tebeka commented on ARROW-488:
-----------------------------------

Is the dtype still integer? I see that Pandas changes the dtype once you add a nan:

{noformat}
In [1]: import pandas as pd

In [2]: import numpy as np

In [3]: s = pd.Series([1,2,3])

In [4]: s
Out[4]: 
0    1
1    2
2    3
dtype: int64

In [5]: s[1] = np.nan

In [6]: s
Out[6]: 
0    1.0
1    NaN
2    3.0
dtype: float64
{noformat}

> [Python] Implement conversion between integer coded as floating points with NaN to an
Arrow integer type
> --------------------------------------------------------------------------------------------------------
>
>                 Key: ARROW-488
>                 URL: https://issues.apache.org/jira/browse/ARROW-488
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: Python
>            Reporter: Wes McKinney
>
> For example: if pandas has casted integer data to float, this would enable the integer
data to be recovered (so long as the values fall in the ~2^53 floating point range for exact
integer representation)



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message