arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Panagiotis Mavrogiorgos (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ARROW-1188) Segfault when trying to serialize a DataFrame with Null-only Categorical Column
Date Thu, 06 Jul 2017 13:29:00 GMT
Panagiotis Mavrogiorgos created ARROW-1188:
----------------------------------------------

             Summary: Segfault when trying to serialize a DataFrame with Null-only Categorical
Column
                 Key: ARROW-1188
                 URL: https://issues.apache.org/jira/browse/ARROW-1188
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: Panagiotis Mavrogiorgos


{code}
import pandas as pd
import feather
print("Pandas: %s" % pd.__version__)
print("Feather: %s" % feather.version.version)
df = pd.DataFrame({"A": (1, 2, 3), "B": ("a", "b", "c")})   # if we use this line everything
works fine
df = pd.DataFrame({"A": (1, 2, 3), "B": (None, None, None)})
df = df.assign(B=df.B.astype("category"))
#feather.write_dataframe(df, "/tmp/foo.feather")   # this segfaults!
{code}

{code}
Pandas: 0.19.2
Feather: 0.4.0
{code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message