impala-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Huaisi Xu (Code Review)" <ger...@cloudera.org>
Subject [Impala-CR](cdh5-2.3.0 5.5.x) CDH-41243: Parquet scanner regression on wide tables
Date Tue, 28 Jun 2016 16:36:23 GMT
Huaisi Xu has submitted this change and it was merged.

Change subject: CDH-41243: Parquet scanner regression on wide tables
......................................................................


CDH-41243: Parquet scanner regression on wide tables

IMPALA-2473 introduced a check that prevent row batches growing
beyond 8MB, but it has a corner case that when an empty row
batch is larger than 8MB, it returns this row batch immediately
after it materialize one row, essentailly setting batch_size=1.

Revert "IMPALA-2473: reduce scanner memory usage"

This reverts commit 1635c0a8738daef1b283cb457fbd3bca227aa0b1.

Change-Id: If6728ed8facd305682d7dfd58f1210fa294bb232
Reviewed-on: http://gerrit.cloudera.org:8080/3484
Reviewed-by: Huaisi Xu <hxu@cloudera.com>
Tested-by: Huaisi Xu <hxu@cloudera.com>
---
M be/src/exec/data-source-scan-node.cc
M be/src/exec/hdfs-parquet-scanner.cc
M be/src/exec/hdfs-scanner.cc
M be/src/exec/hdfs-table-sink.cc
M be/src/exec/hdfs-table-sink.h
M be/src/runtime/row-batch.h
M testdata/workloads/functional-query/queries/QueryTest/nested-types-tpch.test
7 files changed, 26 insertions(+), 81 deletions(-)

Approvals:
  Huaisi Xu: Looks good to me, approved; Verified



-- 
To view, visit http://gerrit.cloudera.org:8080/3484
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: merged
Gerrit-Change-Id: If6728ed8facd305682d7dfd58f1210fa294bb232
Gerrit-PatchSet: 2
Gerrit-Project: Impala
Gerrit-Branch: cdh5-2.3.0_5.5.x
Gerrit-Owner: Huaisi Xu <hxu@cloudera.com>
Gerrit-Reviewer: Huaisi Xu <hxu@cloudera.com>
Gerrit-Reviewer: Tim Armstrong <tarmstrong@cloudera.com>

Mime
View raw message