arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Uwe L. Korn (JIRA)" <>
Subject [jira] [Updated] (ARROW-377) Python: Add support for conversion of Pandas.Categorical
Date Tue, 15 Nov 2016 21:10:59 GMT


Uwe L. Korn updated ARROW-377:
    Priority: Minor  (was: Major)

> Python: Add support for conversion of Pandas.Categorical
> --------------------------------------------------------
>                 Key: ARROW-377
>                 URL:
>             Project: Apache Arrow
>          Issue Type: New Feature
>            Reporter: Uwe L. Korn
>            Priority: Minor
>              Labels: newbie
> At the moment conversion from {{pandas.Categorical}} columns fails with {{ArrowException:
Invalid: only handle 1-dimensional arrays}}. As a better alternative, we should provide one
of the following solutions:
>  * Convert the categorical column to a string (Pandas type {{object}}) column, then use
the conversion routines for strings. Add some metadata to the Arrow column that it was initially
a Pandas string column so that in the case of a roundtrip, it will be a categorical column
>  * Implement the conversion of the column to a dictionary-encoded Arrow column. This
is the preferred solution but may be more complicated to implement as certain requirements
have not yet been implemented.

This message was sent by Atlassian JIRA

View raw message