arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marco Neumann (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ARROW-1083) Object categoricals are not serialized when only None is present
Date Fri, 02 Jun 2017 08:17:04 GMT
Marco Neumann created ARROW-1083:
------------------------------------

             Summary: Object categoricals are not serialized when only None is present
                 Key: ARROW-1083
                 URL: https://issues.apache.org/jira/browse/ARROW-1083
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.4.0
            Reporter: Marco Neumann
            Priority: Minor


The following code sample fails with {{pyarrow.lib.ArrowNotImplementedError: NotImplemented:
unhandled type}} but should not:

{noformat}
import pandas as pd
import pyarrow as pa
import pyarrow.parquet as pq

df = pd.DataFrame({'x': [None]})
df['x'] = df['x'].astype('category')

table = pa.Table.from_pandas(df)
buf = pa.InMemoryOutputStream()

pq.write_table(table, buf)
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message