arrow-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From w...@apache.org
Subject arrow git commit: ARROW-355: Add tests for serialising arrays of empty strings to Parquet
Date Tue, 01 Nov 2016 18:25:08 GMT
Repository: arrow
Updated Branches:
  refs/heads/master d4148759a -> c7db80e72


ARROW-355: Add tests for serialising arrays of empty strings to Parquet

Depends on https://issues.apache.org/jira/browse/PARQUET-759

Author: Uwe L. Korn <uwelk@xhochy.com>

Closes #190 from xhochy/ARROW-355 and squashes the following commits:

e5099ce [Uwe L. Korn] ARROW-355: Add tests for serialising arrays of empty strings to Parquet


Project: http://git-wip-us.apache.org/repos/asf/arrow/repo
Commit: http://git-wip-us.apache.org/repos/asf/arrow/commit/c7db80e7
Tree: http://git-wip-us.apache.org/repos/asf/arrow/tree/c7db80e7
Diff: http://git-wip-us.apache.org/repos/asf/arrow/diff/c7db80e7

Branch: refs/heads/master
Commit: c7db80e729c4b3e984c3ef5630ccbff43f3042b8
Parents: d414875
Author: Uwe L. Korn <uwelk@xhochy.com>
Authored: Tue Nov 1 14:25:01 2016 -0400
Committer: Wes McKinney <wes.mckinney@twosigma.com>
Committed: Tue Nov 1 14:25:01 2016 -0400

----------------------------------------------------------------------
 python/pyarrow/tests/test_parquet.py | 8 ++++++--
 1 file changed, 6 insertions(+), 2 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/arrow/blob/c7db80e7/python/pyarrow/tests/test_parquet.py
----------------------------------------------------------------------
diff --git a/python/pyarrow/tests/test_parquet.py b/python/pyarrow/tests/test_parquet.py
index 0f9f2e4..922ad3a 100644
--- a/python/pyarrow/tests/test_parquet.py
+++ b/python/pyarrow/tests/test_parquet.py
@@ -73,7 +73,8 @@ def test_pandas_parquet_2_0_rountrip(tmpdir):
         'datetime': np.arange("2016-01-01T00:00:00.001", size,
                               dtype='datetime64[ms]'),
         'str': [str(x) for x in range(size)],
-        'str_with_nulls': [None] + [str(x) for x in range(size - 2)] + [None]
+        'str_with_nulls': [None] + [str(x) for x in range(size - 2)] + [None],
+        'empty_str': [''] * size
     })
     filename = tmpdir.join('pandas_rountrip.parquet')
     arrow_table = A.from_pandas_dataframe(df, timestamps_to_ms=True)
@@ -98,7 +99,10 @@ def test_pandas_parquet_1_0_rountrip(tmpdir):
         'int64': np.arange(size, dtype=np.int64),
         'float32': np.arange(size, dtype=np.float32),
         'float64': np.arange(size, dtype=np.float64),
-        'bool': np.random.randn(size) > 0
+        'bool': np.random.randn(size) > 0,
+        'str': [str(x) for x in range(size)],
+        'str_with_nulls': [None] + [str(x) for x in range(size - 2)] + [None],
+        'empty_str': [''] * size
     })
     filename = tmpdir.join('pandas_rountrip.parquet')
     arrow_table = A.from_pandas_dataframe(df)


Mime
View raw message