arrow-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From w...@apache.org
Subject arrow git commit: ARROW-829: Don't deactivate Parquet dictionary encoding on column-wis…
Date Sat, 15 Apr 2017 16:43:25 GMT
Repository: arrow
Updated Branches:
  refs/heads/master 4d2ac871c -> edb8252c7


ARROW-829: Don't deactivate Parquet dictionary encoding on column-wis…

…e compression

Change-Id: Icae5494babc7cbac2e1c3e405e440ff42b2b6ae5

Author: Uwe L. Korn <uwelk@xhochy.com>

Closes #546 from xhochy/ARROW-829 and squashes the following commits:

7962877 [Uwe L. Korn] ARROW-829: Don't deactivate Parquet dictionary encoding on column-wise
compression


Project: http://git-wip-us.apache.org/repos/asf/arrow/repo
Commit: http://git-wip-us.apache.org/repos/asf/arrow/commit/edb8252c
Tree: http://git-wip-us.apache.org/repos/asf/arrow/tree/edb8252c
Diff: http://git-wip-us.apache.org/repos/asf/arrow/diff/edb8252c

Branch: refs/heads/master
Commit: edb8252c7534b787cb4dc0234080765e9bd6a045
Parents: 4d2ac87
Author: Uwe L. Korn <uwelk@xhochy.com>
Authored: Sat Apr 15 12:43:20 2017 -0400
Committer: Wes McKinney <wes.mckinney@twosigma.com>
Committed: Sat Apr 15 12:43:20 2017 -0400

----------------------------------------------------------------------
 python/manylinux1/build_arrow.sh | 2 +-
 python/pyarrow/_parquet.pyx      | 2 --
 2 files changed, 1 insertion(+), 3 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/arrow/blob/edb8252c/python/manylinux1/build_arrow.sh
----------------------------------------------------------------------
diff --git a/python/manylinux1/build_arrow.sh b/python/manylinux1/build_arrow.sh
index 8bc4e60..3df3225 100755
--- a/python/manylinux1/build_arrow.sh
+++ b/python/manylinux1/build_arrow.sh
@@ -72,7 +72,7 @@ for PYTHON in ${PYTHON_VERSIONS}; do
     echo "=== (${PYTHON}) Test the existence of optional modules ==="
     $PIPI_IO -r requirements.txt
     PATH="$PATH:$(cpython_path $PYTHON)/bin" $PYTHON_INTERPRETER -c "import pyarrow.parquet"
-    PATH="$PATH:$(cpython_path $PYTHON)/bin" $PYTHON_INTERPRETER -c "import pyarrow.jemalloc"
+    PATH="$PATH:$(cpython_path $PYTHON)/bin" $PYTHON_INTERPRETER -c "import pyarrow._jemalloc"
 
     echo "=== (${PYTHON}) Tag the wheel with manylinux1 ==="
     mkdir -p repaired_wheels/

http://git-wip-us.apache.org/repos/asf/arrow/blob/edb8252c/python/pyarrow/_parquet.pyx
----------------------------------------------------------------------
diff --git a/python/pyarrow/_parquet.pyx b/python/pyarrow/_parquet.pyx
index b7358a6..dafcdaf 100644
--- a/python/pyarrow/_parquet.pyx
+++ b/python/pyarrow/_parquet.pyx
@@ -539,8 +539,6 @@ cdef class ParquetWriter:
             check_compression_name(self.compression)
             props.compression(compression_from_name(self.compression))
         elif self.compression is not None:
-            # Deactivate dictionary encoding by default
-            props.disable_dictionary()
             for column, codec in self.compression.iteritems():
                 check_compression_name(codec)
                 props.compression(column, compression_from_name(codec))


Mime
View raw message