hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From sreejesh s <sreejesh...@yahoo.com>
Subject Benefit of ORC format storing Sum, Min, Max...
Date Fri, 29 May 2015 23:01:43 GMT
Hi,
I am new to Hive, please help me understand the benefit of ORC file format storing Sum, Min,
Max values.Whenever we try to find a sum of values in a particular column, it still runs the
MapReduce job.
select sum(col1) from orctable;select sum(col1) from txttable;
For a sample file with around 100 records, i dint see any difference in performance running
the above queries .. Please let me know what am i missing...

Thanks


Mime
View raw message