hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hive/LanguageManual/Cli" by JohnSichi
Date Wed, 31 Mar 2010 21:07:43 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Hive/LanguageManual/Cli" page has been changed by JohnSichi.
http://wiki.apache.org/hadoop/Hive/LanguageManual/Cli?action=diff&rev1=13&rev2=14

--------------------------------------------------

  
  === Hive Resources ===
  
- Hive can manage the addition of resources to a session where those resources need to be
made available at query execution time. Currently the only supported resource is the FILE
type. Any locally accessible file can be added to the session. Once a file is added to a session,
hive query can refer to this file by its name (in map/reduce/transform clauses) and this file
is available locally at execution time on the entire hadoop cluster. Hive uses Hadoop's Distributed
Cache to distribute the added files to all the machines in the cluster at query execution
time. 
+ Hive can manage the addition of resources to a session where those resources need to be
made available at query execution time. Any locally accessible file can be added to the session.
Once a file is added to a session, hive query can refer to this file by its name (in map/reduce/transform
clauses) and this file is available locally at execution time on the entire hadoop cluster.
Hive uses Hadoop's Distributed Cache to distribute the added files to all the machines in
the cluster at query execution time. 
  
  Usage:
   {{{
-    ADD FILE[S] <filepath1> [<filepath2>]*
+    ADD { FILE[S] | JAR[S] | ARCHIVE[S] } <filepath1> [<filepath2>]*
-    LIST FILE[S] [<filepath1> <filepath2> ..]
+    LIST { FILE[S] | JAR[S] | ARCHIVE[S] } [<filepath1> <filepath2> ..]
-    DELETE FILE[S] [<filepath1> <filepath2> ..]
+    DELETE { FILE[S] | JAR[S] | ARCHIVE[S] } [<filepath1> <filepath2> ..]
   }}}
  
+  * FILE resources are just added to the distributed cache.  Typically, this might be something
like a transform script to be executed.
+  * JAR resources are also added to the Java classpath.  This is required in order to reference
objects they contain such as UDF's.
+  * ARCHIVE resources are automatically unarchived as part of distributing them.
  
  Example:
   {{{

Mime
View raw message