impala-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Armstrong (Code Review)" <>
Subject [Impala-ASF-CR] IMPALA-3909: Populate min/max statistics in Parquet writer
Date Fri, 27 Jan 2017 21:47:56 GMT
Tim Armstrong has posted comments on this change.

Change subject: IMPALA-3909: Populate min/max statistics in Parquet writer

Patch Set 7: Code-Review+1

File be/src/exec/

PS6, Line 178: ProcessValue
> Marcel had suggested that name, but I'm good with either. Marcel, do you ha
That's fine then, no need to keep renaming it :)

Line 389:   virtual bool ProcessValue(void* value, int64_t* bytes_needed) {
> Done, though it has the same number of lines, but now uses two return state
It doesn't make a big difference in this case - we just tend to use the early-return pattern.
File tests/query_test/

Line 325:     self.execute_query("drop table %s" % qualified_table_name)
Not needed - it should be dropped with the unique_database

Line 434:   def test_write_statistics_multiple_row_groups(self, vector, unique_database):

PS7, Line 446: num_lines

Line 447:     query = "create table %s like %s stored as parquet" % \
A while back someone who was more up-to-date on python suggested that it was better to use
.format() instead of % for string formatting. E.g.

I don't feel strongly but thought I should mention it.

Line 465:       assert l.max < r.min
Maybe this should be <=? E.g. consider two row groups that only have one value for that

Line 467:     self.execute_query("drop table %s" % qualified_target_table)
Not needed - it should be dropped with the unique_database

Line 469:   def test_write_statistics_float_infinity(self, vector, unique_database):
Didn't think of this - good catch.

To view, visit
To unsubscribe, visit

Gerrit-MessageType: comment
Gerrit-Change-Id: I8368ee58daa50c07a3b8ef65be70203eb941f619
Gerrit-PatchSet: 7
Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-Owner: Lars Volker <>
Gerrit-Reviewer: Lars Volker <>
Gerrit-Reviewer: Marcel Kornacker <>
Gerrit-Reviewer: Michael Brown <>
Gerrit-Reviewer: Mostafa Mokhtar <>
Gerrit-Reviewer: Tim Armstrong <>
Gerrit-Reviewer: Zoltan Ivanfi <>
Gerrit-HasComments: Yes

View raw message