impala-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jim Apple (Code Review)" <>
Subject [Impala-ASF-CR] IMPALA-6070: Parallel data load.
Date Wed, 18 Oct 2017 23:17:45 GMT
Jim Apple has posted comments on this change. ( )

Change subject: IMPALA-6070: Parallel data load.

Patch Set 1:

Commit Message:
PS1, Line 9: This commit loads functional-query, TPC-H data, and TPC-DS data in parallel.
nit: Can you wrap this at the red line provided by gerrit? I think it is 72 characters. Emacs
will wrap it for you at the right space with ctrl-q, if you choose.
PS1, Line 12: minuites
nit: minutes
File testdata/bin/
PS1, Line 480:   run-step-backgroundable "Loading functional-query data" load-functional-query.log
Could add a comment about what you decided to background and what you decided not to, and
File testdata/bin/
PS1, Line 75:   HADOOP_HEAPSIZE="1024" hive --service hiveserver2 > ${LOGDIR}/hive-server2.out
2>&1 &
> I'm currently testing to see if 512 is enough.
This looks like it will also increase HADOOP_HEAPSIZE when not doing a parallel load, which
is a shame. Do you see a way around that?
File testdata/bin/
PS1, Line 53: 
nit: only one empty line, to match context
PS1, Line 84:   RUN_STEP_PIDS=()
Do you want to reset MSGS, too?

To view, visit
To unsubscribe, visit

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I836c4e1586f229621c102c4f4ba22ce7224ab9ac
Gerrit-Change-Number: 8320
Gerrit-PatchSet: 1
Gerrit-Owner: Philip Zeyliger <>
Gerrit-Reviewer: Jim Apple <>
Gerrit-Reviewer: Joe McDonnell <>
Gerrit-Reviewer: Philip Zeyliger <>
Gerrit-Comment-Date: Wed, 18 Oct 2017 23:17:45 +0000
Gerrit-HasComments: Yes

  • Unnamed multipart/alternative (inline, 8-Bit, 0 bytes)
View raw message