impala-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Huaisi Xu (Code Review)" <ger...@cloudera.org>
Subject [Impala-CR](cdh5-2.4.0 5.6.x) CDH-41243: Parquet scanner regression on wide tables
Date Fri, 24 Jun 2016 21:51:49 GMT
Huaisi Xu has uploaded a new change for review.

  http://gerrit.cloudera.org:8080/3491

Change subject: CDH-41243: Parquet scanner regression on wide tables
......................................................................

CDH-41243: Parquet scanner regression on wide tables

IMPALA-2473 introduced a check that prevent row batches growing
beyond 8MB, but it has a corner case that when an empty row
batch is larger than 8MB, it returns this row batch immediately
after it materialize one row, essentailly setting batch_size=1.

Revert "IMPALA-2473: reduce scanner memory usage"

This reverts commit effb60488b9dda63e318256aa404673c3e4b17e7.

Change-Id: Ic94d818275709bfeb6880e26f45a0dd16ed07af4
---
M be/src/exec/data-source-scan-node.cc
M be/src/exec/hdfs-parquet-scanner.cc
M be/src/exec/hdfs-scanner.cc
M be/src/exec/hdfs-table-sink.cc
M be/src/exec/hdfs-table-sink.h
M be/src/runtime/row-batch.h
M testdata/workloads/functional-query/queries/QueryTest/nested-types-tpch.test
7 files changed, 26 insertions(+), 81 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/91/3491/1
-- 
To view, visit http://gerrit.cloudera.org:8080/3491
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: newchange
Gerrit-Change-Id: Ic94d818275709bfeb6880e26f45a0dd16ed07af4
Gerrit-PatchSet: 1
Gerrit-Project: Impala
Gerrit-Branch: cdh5-2.4.0_5.6.x
Gerrit-Owner: Huaisi Xu <hxu@cloudera.com>

Mime
View raw message