hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hive/AdminManual/Configuration" by JoydeepSensarma
Date Mon, 21 Jun 2010 05:28:07 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Hive/AdminManual/Configuration" page has been changed by JoydeepSensarma.


  ||hive.partition.name||The name of the hive partition. This is passed to the user scripts
through the script operator.|| ||
  ||hive.alias||The alias being processed. This is also passed to the user scripts through
the script operator.|| ||
+ === Temporary Folders ===
+ Hive uses temporary folders both on the machine running the Hive client and the default
HDFS instance. These folders are used to store per-query temporary/intermediate data sets
and are normally cleaned up by the hive client when the query is finished. However, in cases
of abnormal hive client termination, some data may be left behind. The configuration details
are as follows:
+  * On the HDFS cluster this is set to ''/tmp/hive-<username>'' by default and is controlled
by the configuration variable ''hive.exec.scratchdir''
+  * On the client machine, this is hardcoded to ''/tmp/<username>''
+ Note that when writing data to a table/partition, Hive will first write to a temporary location
on the target table's filesystem (using hive.exec.scratchdir as the temporary location) and
then move the data to the target table. This applies in all cases - whether tables are stored
in HDFS (normal case) or in file systems like S3 or even NFS.
+ === Log Files ===
+ Hive client produces logs and history files on the client machine. Please see [[http://wiki.apache.org/hadoop/Hive/GettingStarted#Error_Logs|
Error Logs]] on configuration details.

View raw message