tez-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bi...@apache.org
Subject git commit: TEZ-1464. Update INSTALL.txt (bikas)
Date Wed, 20 Aug 2014 01:21:58 GMT
Repository: tez
Updated Branches:
  refs/heads/master 57bd9f7e3 -> 4b52bc923


TEZ-1464. Update INSTALL.txt (bikas)


Project: http://git-wip-us.apache.org/repos/asf/tez/repo
Commit: http://git-wip-us.apache.org/repos/asf/tez/commit/4b52bc92
Tree: http://git-wip-us.apache.org/repos/asf/tez/tree/4b52bc92
Diff: http://git-wip-us.apache.org/repos/asf/tez/diff/4b52bc92

Branch: refs/heads/master
Commit: 4b52bc923012566affddb1aa56d924481b13b392
Parents: 57bd9f7
Author: Bikas Saha <bikas@apache.org>
Authored: Tue Aug 19 18:21:56 2014 -0700
Committer: Bikas Saha <bikas@apache.org>
Committed: Tue Aug 19 18:21:56 2014 -0700

----------------------------------------------------------------------
 INSTALL.txt | 38 ++++++++++++++++++++++----------------
 1 file changed, 22 insertions(+), 16 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/tez/blob/4b52bc92/INSTALL.txt
----------------------------------------------------------------------
diff --git a/INSTALL.txt b/INSTALL.txt
index cd29fce..9e49b17 100644
--- a/INSTALL.txt
+++ b/INSTALL.txt
@@ -2,7 +2,7 @@ How to use TEZ
 =======================
 
 Tez provides an ApplicationMaster that can run any arbritary DAG of tasks. It also
-provides a translation layer to run MR or MRR jobs using the MR APIs. This translation
+provides a translation layer to run MR jobs using the MR APIs. This translation
 layer is not fully feature compatible so if you do see any issues with running your
 existing MR jobs on TEZ, please file jiras.
 
@@ -20,10 +20,10 @@ Install/Deploy Instructions
      "hadoop dfs -copyFromLocal tez-dist/target/tez-0.5.0-SNAPSHOT-archive.tar.gz /apps/tez-0.5.0-SNAPSHOT/"
    - tez-site.xml configuration 
      - Set tez.lib.uris to point to the tar.gz uploaded to HDFS. Assuming the steps mentioned
so far were followed,
-       set tez.lib.uris to "${fs.default.name}/apps/tez-0.5.0-SNAPSHOT/tez-0.5.0-SNAPSHOT.tar.gz"
+       set tez.lib.uris to "${fs.defaultFS}/apps/tez-0.5.0-SNAPSHOT/tez-0.5.0-SNAPSHOT.tar.gz"
      - Ensure tez.use.cluster.hadoop-libs is not set in tez-site.xml, or if it is set, the
value should be false
-4) Optional: If running existing MapReduce jobs on Tez. Modify mapred-site.xml to change
"mapreduce.framework.name" property from its
-   default value of "yarn" to "yarn-tez"
+4) Optional: If running existing MapReduce jobs on Tez. Modify mapred-site.xml to change

+"mapreduce.framework.name" property from its default value of "yarn" to "yarn-tez"
 5) Configure the client node to include the tez-libraries in the hadoop classpath
    - Extract the tez tarball created in step 2 to a local directory - (assuming TEZ_JARS
is where the files will be decompressed for the next steps)
      "tar -xvzf tez-dist/target/tez-0.5.0-SNAPSHOT.tar.gz -C $TEZ_JARS"
@@ -33,14 +33,7 @@ Install/Deploy Instructions
        "export HADOOP_CLASSPATH=${TEZ_CONF_DIR}:${TEZ_JARS}/*:${TEZ_JARS}/lib/*"
      - Please note the "*" which is an important requirement when setting up classpaths for
directories containing jar files.
 
-6) Submit a MR job as you normally would using something like:
-
-$HADOOP_PREFIX/bin/hadoop jar hadoop-mapreduce-client-jobclient-2.2.0-tests.jar sleep -mt
1 -rt 1 -m 1 -r 1
-
-This will use the TEZ DAG ApplicationMaster to run the MR job. This can be
-verified by looking at the AM's logs from the YARN ResourceManager UI.
-
-7) There is a basic example of using an MRR job in the tez-examples.jar. Refer to OrderedWordCount.java
+6) There is a basic example of a Tez job in the tez-examples.jar. Refer to OrderedWordCount.java
 in the source code. To run this example:
 
 $HADOOP_PREFIX/bin/hadoop jar tez-examples.jar orderedwordcount <input> <output>
@@ -61,14 +54,27 @@ set -DUSE_TEZ_SESSION=true
 
 $HADOOP_PREFIX/bin/hadoop jar tez-tests.jar testorderedwordcount -DUSE_TEZ_SESSION=true <input1>
<output1> <input2> <output2>
 
+7) To test MR jobs you can submit an MR job as you normally would using something like:
 
-Alternate machanism to setup Tez to use Hadoop libraries from the cluster.
-Step 3 changes as follows. Also subsequent steps would use tez-dist/target/tez-0.5.0-SNAPSHOT-minimal.tar.gz
instead of tez-dist/target/tez-0.5.0-SNAPSHOT.tar.gz
+$HADOOP_PREFIX/bin/hadoop jar hadoop-mapreduce-client-jobclient-2.2.0-tests.jar sleep -mt
1 -rt 1 -m 1 -r 1
+
+This will use the TEZ DAG ApplicationMaster to run the MR job. This can be verified by looking
at 
+the AM's logs from the YARN ResourceManager UI. This needs mapred-site.xml to have "mapreduce.framework.name"

+set to "yarn-tez"
+
+
+Hadoop Installation dependent Install/Deploy Instructions
+=========================================================
+The above install instructions use Tez with pre-packaged Hadoop libraries included in the
package and is the 
+recommended method for installation. If its needed to make Tez use the existing cluster Hadoop
libraries then
+follow this alternate machanism to setup Tez to use Hadoop libraries from the cluster.
+Step 3 above changes as follows. Also subsequent steps would use tez-dist/target/tez-0.5.0-SNAPSHOT-minimal.tar.gz
instead of tez-dist/target/tez-0.5.0-SNAPSHOT.tar.gz
    - A tez build without Hadoop dependencies will be available at tez-dist/target/tez-0.5.0-SNAPSHOT-minimal.tar.gz
    - Assuming that the tez jars are put in /apps/ on HDFS, the command would be
      "hadoop dfs -mkdir /apps/tez-0.5.0-SNAPSHOT"
      "hadoop dfs -copyFromLocal tez-dist/target/tez-0.5.0-SNAPSHOT-archive-minimal.tar.gz
/apps/tez-0.5.0-SNAPSHOT"
    - tez-site.xml configuration
      - Set tez.lib.uris to point to the paths in HDFS containing the tez jars. Assuming the
steps mentioned so far were followed,
-     set tez.lib.uris to "${fs.default.name}/apps/tez-0.5.0-SNAPSHOT/tez-0.5.0-SNAPSHOT-minimal.tar.gz
-     - Also set tez.use.cluster.hadoop-libs to true
+     set tez.lib.uris to "${fs.defaultFS}/apps/tez-0.5.0-SNAPSHOT/tez-0.5.0-SNAPSHOT-minimal.tar.gz
+     - set tez.use.cluster.hadoop-libs to true
+


Mime
View raw message