hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gautam <gautamkows...@gmail.com>
Subject ORC file sort order ..
Date Sat, 09 Apr 2016 00:53:50 GMT
Hey,

           This might be too obvious a question but I haven't found a way
to validate ordering in an ORC file. I need each file to be ordered by a
column, Is there a sure shot way of ensuring the sort order in an ORC file
is as I expect it?

The closest i'v come to is using the hive --orcfiledump --rowindex <col_id>
which prints that columns min/max values in the index. But that is still
not saying if the data within the stripes is sorted.

Cheers,
-Gautam.

Mime
View raw message