hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gautam <>
Subject ORC file sort order ..
Date Sat, 09 Apr 2016 00:53:50 GMT

           This might be too obvious a question but I haven't found a way
to validate ordering in an ORC file. I need each file to be ordered by a
column, Is there a sure shot way of ensuring the sort order in an ORC file
is as I expect it?

The closest i'v come to is using the hive --orcfiledump --rowindex <col_id>
which prints that columns min/max values in the index. But that is still
not saying if the data within the stripes is sorted.


View raw message