hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Manhee Jo>
Subject loading data from HDFS or local file to
Date Wed, 22 Jul 2009 07:25:24 GMT
Hi all,

What really happens when a huge file (e.g. some tens of TB) is "LOADed DATA 
INTO TABLE"? Does hive need to scan the entire file before processing 
anything even very simple (e.g. select)?
If so, are there any solutions to decrease the number of disk access? Is 
partitioning a way to do it?

Many Thanks,

View raw message