hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From jia...@apache.org
Subject [6/9] hadoop git commit: YARN-3285. (Backport YARN-3168) Convert branch-2 .apt.vm files of YARN to markdown. Contributed by Masatake Iwasaki
Date Wed, 04 Mar 2015 00:45:56 GMT
http://git-wip-us.apache.org/repos/asf/hadoop/blob/aafe5713/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/ResourceManagerRestart.apt.vm
----------------------------------------------------------------------
diff --git a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/ResourceManagerRestart.apt.vm b/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/ResourceManagerRestart.apt.vm
deleted file mode 100644
index a08c19d..0000000
--- a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/ResourceManagerRestart.apt.vm
+++ /dev/null
@@ -1,298 +0,0 @@
-~~ Licensed under the Apache License, Version 2.0 (the "License");
-~~ you may not use this file except in compliance with the License.
-~~ You may obtain a copy of the License at
-~~
-~~   http://www.apache.org/licenses/LICENSE-2.0
-~~
-~~ Unless required by applicable law or agreed to in writing, software
-~~ distributed under the License is distributed on an "AS IS" BASIS,
-~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-~~ See the License for the specific language governing permissions and
-~~ limitations under the License. See accompanying LICENSE file.
-
-  ---
-  ResourceManager Restart
-  ---
-  ---
-  ${maven.build.timestamp}
-
-ResourceManager Restart
-
-%{toc|section=1|fromDepth=0}
-
-* {Overview}
-
-  ResourceManager is the central authority that manages resources and schedules
-  applications running atop of YARN. Hence, it is potentially a single point of
-  failure in a Apache YARN cluster.
-
-  This document gives an overview of ResourceManager Restart, a feature that
-  enhances ResourceManager to keep functioning across restarts and also makes
-  ResourceManager down-time invisible to end-users.
-
-  ResourceManager Restart feature is divided into two phases:
-
-  ResourceManager Restart Phase 1 (Non-work-preserving RM restart):
-  Enhance RM to persist application/attempt state
-  and other credentials information in a pluggable state-store. RM will reload
-  this information from state-store upon restart and re-kick the previously
-  running applications. Users are not required to re-submit the applications.
-
-  ResourceManager Restart Phase 2 (Work-preserving RM restart):
-  Focus on re-constructing the running state of ResourceManager by combining
-  the container statuses from NodeManagers and container requests from ApplicationMasters
-  upon restart. The key difference from phase 1 is that previously running applications
-  will not be killed after RM restarts, and so applications won't lose its work
-  because of RM outage.
-
-* {Feature}
-
-** Phase 1: Non-work-preserving RM restart
-
-  As of Hadoop 2.4.0 release, only ResourceManager Restart Phase 1 is implemented which
-  is described below.
-
-  The overall concept is that RM will persist the application metadata
-  (i.e. ApplicationSubmissionContext) in
-  a pluggable state-store when client submits an application and also saves the final status
-  of the application such as the completion state (failed, killed, finished) 
-  and diagnostics when the application completes. Besides, RM also saves
-  the credentials like security keys, tokens to work in a secure environment.
-  Any time RM shuts down, as long as the required information (i.e.application metadata
-  and the alongside credentials if running in a secure environment) is available
-  in the state-store, when RM restarts, it can pick up the application metadata
-  from the state-store and re-submit the application. RM won't re-submit the
-  applications if they were already completed (i.e. failed, killed, finished)
-  before RM went down.
-
-  NodeManagers and clients during the down-time of RM will keep polling RM until 
-  RM comes up. When RM becomes alive, it will send a re-sync command to
-  all the NodeManagers and ApplicationMasters it was talking to via heartbeats.
-  As of Hadoop 2.4.0 release, the behaviors for NodeManagers and ApplicationMasters to handle this command
-  are: NMs will kill all its managed containers and re-register with RM. From the
-  RM's perspective, these re-registered NodeManagers are similar to the newly joining NMs. 
-  AMs(e.g. MapReduce AM) are expected to shutdown when they receive the re-sync command.
-  After RM restarts and loads all the application metadata, credentials from state-store
-  and populates them into memory, it will create a new
-  attempt (i.e. ApplicationMaster) for each application that was not yet completed
-  and re-kick that application as usual. As described before, the previously running
-  applications' work is lost in this manner since they are essentially killed by
-  RM via the re-sync command on restart.
-
-** Phase 2: Work-preserving RM restart
-
-  As of Hadoop 2.6.0, we further enhanced RM restart feature to address the problem 
-  to not kill any applications running on YARN cluster if RM restarts.
-
-  Beyond all the groundwork that has been done in Phase 1 to ensure the persistency
-  of application state and reload that state on recovery, Phase 2 primarily focuses
-  on re-constructing the entire running state of YARN cluster, the majority of which is
-  the state of the central scheduler inside RM which keeps track of all containers' life-cycle,
-  applications' headroom and resource requests, queues' resource usage etc. In this way,
-  RM doesn't need to kill the AM and re-run the application from scratch as it is
-  done in Phase 1. Applications can simply re-sync back with RM and
-  resume from where it were left off.
-
-  RM recovers its runing state by taking advantage of the container statuses sent from all NMs.
-  NM will not kill the containers when it re-syncs with the restarted RM. It continues
-  managing the containers and send the container statuses across to RM when it re-registers.
-  RM reconstructs the container instances and the associated applications' scheduling status by
-  absorbing these containers' information. In the meantime, AM needs to re-send the
-  outstanding resource requests to RM because RM may lose the unfulfilled requests when it shuts down.
-  Application writers using AMRMClient library to communicate with RM do not need to
-  worry about the part of AM re-sending resource requests to RM on re-sync, as it is
-  automatically taken care by the library itself.
-
-* {Configurations}
-
-** Enable RM Restart.
-
-*--------------------------------------+--------------------------------------+
-|| Property                            || Value                                |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.recovery.enabled>>> | |
-| | <<<true>>> |
-*--------------------------------------+--------------------------------------+ 
-
-
-** Configure the state-store for persisting the RM state.
-
-
-*--------------------------------------*--------------------------------------+
-|| Property                            || Description                        |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.store.class>>> | |
-| | The class name of the state-store to be used for saving application/attempt |
-| | state and the credentials. The available state-store implementations are  |
-| | <<<org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore>>> |
-| | , a ZooKeeper based state-store implementation and  |
-| | <<<org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore>>> |
-| | , a Hadoop FileSystem based state-store implementation like HDFS and local FS. |
-| | <<<org.apache.hadoop.yarn.server.resourcemanager.recovery.LeveldbRMStateStore>>>, |
-| | a LevelDB based state-store implementation. |
-| | The default value is set to |
-| | <<<org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore>>>. |
-*--------------------------------------+--------------------------------------+ 
-
-** How to choose the state-store implementation.
-
-    <<ZooKeeper based state-store>>: User is free to pick up any storage to set up RM restart,
-    but must use ZooKeeper based state-store to support RM HA. The reason is that only ZooKeeper
-    based state-store supports fencing mechanism to avoid a split-brain situation where multiple
-    RMs assume they are active and can edit the state-store at the same time.
-
-    <<FileSystem based state-store>>: HDFS and local FS based state-store are supported. 
-    Fencing mechanism is not supported.
-
-    <<LevelDB based state-store>>: LevelDB based state-store is considered more light weight than HDFS and ZooKeeper
-    based state-store. LevelDB supports better atomic operations, fewer I/O ops per state update,
-    and far fewer total files on the filesystem. Fencing mechanism is not supported.
-
-** Configurations for Hadoop FileSystem based state-store implementation.
-
-    Support both HDFS and local FS based state-store implementation. The type of file system to
-    be used is determined by the scheme of URI. e.g. <<<hdfs://localhost:9000/rmstore>>> uses HDFS as the storage and
-    <<<file:///tmp/yarn/rmstore>>> uses local FS as the storage. If no
-    scheme(<<<hdfs://>>> or <<<file://>>>) is specified in the URI, the type of storage to be used is
-    determined by <<<fs.defaultFS>>> defined in <<<core-site.xml>>>.
-
-    Configure the URI where the RM state will be saved in the Hadoop FileSystem state-store.
-
-*--------------------------------------+--------------------------------------+
-|| Property                            || Description                        |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.fs.state-store.uri>>> | |
-| | URI pointing to the location of the FileSystem path where RM state will |
-| | be stored (e.g. hdfs://localhost:9000/rmstore). |
-| | Default value is <<<${hadoop.tmp.dir}/yarn/system/rmstore>>>. |
-| | If FileSystem name is not provided, <<<fs.default.name>>> specified in |
-| | <<conf/core-site.xml>> will be used. |
-*--------------------------------------+--------------------------------------+ 
-
-    Configure the retry policy state-store client uses to connect with the Hadoop
-    FileSystem.
-
-*--------------------------------------+--------------------------------------+
-|| Property                            || Description                        |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.fs.state-store.retry-policy-spec>>> | |
-| | Hadoop FileSystem client retry policy specification. Hadoop FileSystem client retry | 
-| | is always enabled. Specified in pairs of sleep-time and number-of-retries | 
-| | i.e. (t0, n0), (t1, n1), ..., the first n0 retries sleep t0 milliseconds on |
-| | average, the following n1 retries sleep t1 milliseconds on average, and so on. |
-| | Default value is (2000, 500) |
-*--------------------------------------+--------------------------------------+ 
-
-** Configurations for ZooKeeper based state-store implementation.
-  
-    Configure the ZooKeeper server address and the root path where the RM state is stored.
-
-*--------------------------------------+--------------------------------------+
-|| Property                            || Description                        |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.zk-address>>> | |
-| | Comma separated list of Host:Port pairs. Each corresponds to a ZooKeeper server |
-| | (e.g. "127.0.0.1:3000,127.0.0.1:3001,127.0.0.1:3002") to be used by the RM |
-| | for storing RM state. |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.zk-state-store.parent-path>>> | |
-| | The full path of the root znode where RM state will be stored. |
-| | Default value is /rmstore. |
-*--------------------------------------+--------------------------------------+
-
-    Configure the retry policy state-store client uses to connect with the ZooKeeper server.
-
-*--------------------------------------+--------------------------------------+
-|| Property                            || Description                        |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.zk-num-retries>>> | |
-| | Number of times RM tries to connect to ZooKeeper server if the connection is lost. |
-| | Default value is 500. |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.zk-retry-interval-ms>>> |
-| | The interval in milliseconds between retries when connecting to a ZooKeeper server. |
-| | Default value is 2 seconds. |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.zk-timeout-ms>>> | |
-| | ZooKeeper session timeout in milliseconds. This configuration is used by  |
-| | the ZooKeeper server to determine when the session expires. Session expiration |
-| | happens when the server does not hear from the client (i.e. no heartbeat) within the session |
-| | timeout period specified by this configuration. Default |
-| | value is 10 seconds |
-*--------------------------------------+--------------------------------------+
-
-    Configure the ACLs to be used for setting permissions on ZooKeeper znodes.
-
-*--------------------------------------+--------------------------------------+
-|| Property                            || Description                        |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.zk-acl>>> | |
-| | ACLs to be used for setting permissions on ZooKeeper znodes. Default value is <<<world:anyone:rwcda>>> |
-*--------------------------------------+--------------------------------------+
-
-** Configurations for LevelDB based state-store implementation.
-
-*--------------------------------------+--------------------------------------+
-|| Property                            || Description                        |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.leveldb-state-store.path>>> | |
-| | Local path where the RM state will be stored. |
-| | Default value is <<<${hadoop.tmp.dir}/yarn/system/rmstore>>> |
-*--------------------------------------+--------------------------------------+
-
-
-**  Configurations for work-preserving RM recovery.
-
-*--------------------------------------+--------------------------------------+
-|| Property                            || Description                        |
-*--------------------------------------+--------------------------------------+
-| <<<yarn.resourcemanager.work-preserving-recovery.scheduling-wait-ms>>> | |
-| | Set the amount of time RM waits before allocating new |
-| | containers on RM work-preserving recovery. Such wait period gives RM a chance | 
-| | to settle down resyncing with NMs in the cluster on recovery, before assigning|
-| |  new containers to applications.|
-*--------------------------------------+--------------------------------------+
-
-* {Notes}
-
-  ContainerId string format is changed if RM restarts with work-preserving recovery enabled.
-  It used to be such format:
-
-   Container_\{clusterTimestamp\}_\{appId\}_\{attemptId\}_\{containerId\}, e.g. Container_1410901177871_0001_01_000005.
-
-  It is now changed to:
-
-   Container_<<e\{epoch\}>>_\{clusterTimestamp\}_\{appId\}_\{attemptId\}_\{containerId\}, e.g. Container_<<e17>>_1410901177871_0001_01_000005.
- 
-  Here, the additional epoch number is a
-  monotonically increasing integer which starts from 0 and is increased by 1 each time
-  RM restarts. If epoch number is 0, it is omitted and the containerId string format
-  stays the same as before.
-
-* {Sample configurations}
-
-   Below is a minimum set of configurations for enabling RM work-preserving restart using ZooKeeper based state store.
-
-+---+
-  <property>
-    <description>Enable RM to recover state after starting. If true, then 
-    yarn.resourcemanager.store.class must be specified</description>
-    <name>yarn.resourcemanager.recovery.enabled</name>
-    <value>true</value>
-  </property>
-
-  <property>
-    <description>The class to use as the persistent store.</description>
-    <name>yarn.resourcemanager.store.class</name>
-    <value>org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore</value>
-  </property>
-
-  <property>
-    <description>Comma separated list of Host:Port pairs. Each corresponds to a ZooKeeper server
-    (e.g. "127.0.0.1:3000,127.0.0.1:3001,127.0.0.1:3002") to be used by the RM for storing RM state.
-    This must be supplied when using org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore
-    as the value for yarn.resourcemanager.store.class</description>
-    <name>yarn.resourcemanager.zk-address</name>
-    <value>127.0.0.1:2181</value>
-  </property>
-+---+
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/hadoop/blob/aafe5713/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/SecureContainer.apt.vm
----------------------------------------------------------------------
diff --git a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/SecureContainer.apt.vm b/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/SecureContainer.apt.vm
deleted file mode 100644
index 0365bf7..0000000
--- a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/SecureContainer.apt.vm
+++ /dev/null
@@ -1,176 +0,0 @@
-~~ Licensed under the Apache License, Version 2.0 (the "License");
-~~ you may not use this file except in compliance with the License.
-~~ You may obtain a copy of the License at
-~~
-~~   http://www.apache.org/licenses/LICENSE-2.0
-~~
-~~ Unless required by applicable law or agreed to in writing, software
-~~ distributed under the License is distributed on an "AS IS" BASIS,
-~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-~~ See the License for the specific language governing permissions and
-~~ limitations under the License. See accompanying LICENSE file.
-
-  ---
-  YARN Secure Containers
-  ---
-  ---
-  ${maven.build.timestamp}
-
-YARN Secure Containers
-
-%{toc|section=1|fromDepth=0|toDepth=3}
-
-* {Overview}
-
-  YARN containers in a secure cluster use the operating system facilities to offer
-  execution isolation for containers. Secure containers execute under the credentials
-  of the job user. The operating system enforces access restriction for the container.
-  The container must run as the use that submitted the application.
-  
-  Secure Containers work only in the context of secured YARN clusters.
-  
-  ** Container isolation requirements
-  
-    The container executor must access the local files and directories needed by the 
-    container such as jars, configuration files, log files, shared objects etc. Although
-    it is launched by the NodeManager, the container should not have access to the 
-    NodeManager private files and configuration. Container running applications 
-    submitted by different users should be isolated and unable to access each other
-    files and directories. Similar requirements apply to other system non-file securable 
-    objects like named pipes, critical sections, LPC queues, shared memory etc.
-    
-    
-  ** Linux Secure Container Executor
-
-    On Linux environment the secure container executor is the <<<LinuxContainerExecutor>>>.
-    It uses an external program called the <<container-executor>>> to launch the container.
-    This program has the <<<setuid>>> access right flag set which allows it to launch 
-    the container with the permissions of the YARN application user.
-    
-  *** Configuration
-
-      The configured directories for <<<yarn.nodemanager.local-dirs>>> and 
-      <<<yarn.nodemanager.log-dirs>>> must be owned by the configured NodeManager user
-      (<<<yarn>>>) and group (<<<hadoop>>>). The permission set on these directories must
-      be <<<drwxr-xr-x>>>.
-      
-      The <<<container-executor>>> program must be owned by <<<root>>> and have the
-      permission set <<<---sr-s--->>>.
-
-      To configure the <<<NodeManager>>> to use the <<<LinuxContainerExecutor>>> set the following 
-      in the <<conf/yarn-site.xml>>:
-
-+---+
-<property>
-  <name>yarn.nodemanager.container-executor.class</name>
-  <value>org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor</value>
-</property>
-
-<property>
-  <name>yarn.nodemanager.linux-container-executor.group</name>
-  <value>hadoop</value>
-</property>
-+---+
-
-      Additionally the LCE requires the <<<container-executor.cfg>>> file, which is read by the
-      <<<container-executor>>> program. 
-
-+---+
-yarn.nodemanager.linux-container-executor.group=#configured value of yarn.nodemanager.linux-container-executor.group
-banned.users=#comma separated list of users who can not run applications
-allowed.system.users=#comma separated list of allowed system users
-min.user.id=1000#Prevent other super-users
-+---+
-
-   
-  ** Windows Secure Container Executor (WSCE)
-  
-    The Windows environment secure container executor is the <<<WindowsSecureContainerExecutor>>>.
-    It uses the Windows S4U infrastructure to launch the container as the 
-    YARN application user. The WSCE requires the presense of the <<<hadoopwinutilsvc>>>  service. This services
-    is hosted by <<<%HADOOP_HOME%\bin\winutils.exe>>> started with the <<<service>>> command line argument. This
-    service offers some privileged operations that require LocalSystem authority so that the NM is not required
-    to run the entire JVM and all the NM code in an elevated context. The NM interacts with the <<<hadoopwintulsvc>>>
-    service by means of Local RPC (LRPC) via calls JNI to the RCP client hosted in <<<hadoop.dll>>>.
-    
-  *** Configuration
-  
-      To configure the <<<NodeManager>>> to use the <<<WindowsSecureContainerExecutor>>> 
-      set the following in the <<conf/yarn-site.xml>>:
-
-+---+
-<property>
-  <name>yarn.nodemanager.container-executor.class</name>
-  <value>org.apache.hadoop.yarn.server.nodemanager.WindowsSecureContainerExecutor</value>
-</property>
-
-<property>
-  <name>yarn.nodemanager.windows-secure-container-executor.group</name>
-  <value>yarn</value>
-</property>
-+---+
-  *** wsce-site.xml
-  
-      The hadoopwinutilsvc uses <<<%HADOOP_HOME%\etc\hadoop\wsce_site.xml>>> to configure access to the privileged operations.
-
-+---+
-  <property>
-    <name>yarn.nodemanager.windows-secure-container-executor.impersonate.allowed</name>
-    <value>HadoopUsers</value>
-  </property>
-  
-  <property>
-    <name>yarn.nodemanager.windows-secure-container-executor.impersonate.denied</name>
-    <value>HadoopServices,Administrators</value>
-  </property>
-  
-  <property>
-    <name>yarn.nodemanager.windows-secure-container-executor.allowed</name>
-    <value>nodemanager</value>
-  </property>
-
-  <property>
-    <name>yarn.nodemanager.windows-secure-container-executor.local-dirs</name>
-    <value>nm-local-dir, nm-log-dirs</value>
-  </property>
-
-  <property>
-    <name>yarn.nodemanager.windows-secure-container-executor.job-name</name>
-    <value>nodemanager-job-name</value>
-  </property>  
-+---+
-      
-      <<<yarn.nodemanager.windows-secure-container-executor.allowed>>> should contain the name of the service account running the 
-      nodemanager. This user will be allowed to access the hadoopwintuilsvc functions.
-      
-      <<<yarn.nodemanager.windows-secure-container-executor.impersonate.allowed>>> should contain the users that are allowed to create
-      containers in the cluster. These users will be allowed to be impersonated by hadoopwinutilsvc.
-      
-      <<<yarn.nodemanager.windows-secure-container-executor.impersonate.denied>>> should contain users that are explictly forbiden from
-      creating containers. hadoopwinutilsvc will refuse to impersonate these users.
-
-      <<<yarn.nodemanager.windows-secure-container-executor.local-dirs>>> should contain the nodemanager local dirs. hadoopwinutilsvc will
-      allow only file operations under these directories. This should contain the same values as <<<${yarn.nodemanager.local-dirs}, ${yarn.nodemanager.log-dirs}>>> 
-      but note that hadoopwinutilsvc XML configuration processing does not do substitutions so the value must be the final value. All paths 
-      must be absolute and no environment variable substitution will be performed. The paths are compared LOCAL_INVARIANT case insensitive string comparison,
-      the file path validated must start with one of the paths listed in local-dirs configuration. Use comma as path separator:<<<,>>>
-
-      <<<yarn.nodemanager.windows-secure-container-executor.job-name>>> should contain an Windows NT job name that all containers should be added to. 
-      This configuration is optional. If not set, the container is not added to a global NodeManager job. Normally this should be set to the job that the NM is assigned to, 
-      so that killing the NM kills also all containers. Hadoopwinutilsvc will not attempt to create this job, the job must exists when the container is launched.
-      If the value is set and the job does not exists, container launch will fail with error 2 <<<The system cannot find the file specified>>>.
-      Note that this global NM job is not related to the container job, which always gets created for each container and is named after the container ID.
-      This setting controls a global job that spans all containers and the parent NM, and as such it requires nested jobs. 
-      Nested jobs are available only post Windows 8 and Windows Server 2012.
-      
-  *** Useful Links
-    
-    * {{{http://msdn.microsoft.com/en-us/magazine/cc188757.aspx}Exploring S4U Kerberos Extensions in Windows Server 2003}}
-    
-    * {{{http://msdn.microsoft.com/en-us/library/windows/desktop/hh448388(v=vs.85).aspx}Nested Jobs}}
-
-    * {{{https://issues.apache.org/jira/browse/YARN-1063}Winutils needs ability to create task as domain user}}
-    
-    * {{{https://issues.apache.org/jira/browse/YARN-1972}Implement secure Windows Container Executor}}
-
-    * {{{https://issues.apache.org/jira/browse/YARN-2198}Remove the need to run NodeManager as privileged account for Windows Secure Container Executor}}
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/hadoop/blob/aafe5713/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/TimelineServer.apt.vm
----------------------------------------------------------------------
diff --git a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/TimelineServer.apt.vm b/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/TimelineServer.apt.vm
deleted file mode 100644
index 6c8203c..0000000
--- a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/TimelineServer.apt.vm
+++ /dev/null
@@ -1,260 +0,0 @@
-~~ Licensed under the Apache License, Version 2.0 (the "License");
-~~ you may not use this file except in compliance with the License.
-~~ You may obtain a copy of the License at
-~~
-~~   http://www.apache.org/licenses/LICENSE-2.0
-~~
-~~ Unless required by applicable law or agreed to in writing, software
-~~ distributed under the License is distributed on an "AS IS" BASIS,
-~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-~~ See the License for the specific language governing permissions and
-~~ limitations under the License. See accompanying LICENSE file.
-
-  ---
-  YARN Timeline Server
-  ---
-  ---
-  ${maven.build.timestamp}
-
-YARN Timeline Server
-
-%{toc|section=1|fromDepth=0|toDepth=3}
-
-* Overview
-
-  Storage and retrieval of applications' current as well as historic
-  information in a generic fashion is solved in YARN through the Timeline
-  Server (previously also called Generic Application History Server). This
-  serves two responsibilities:
-
-  ** Generic information about completed applications
-  
-    Generic information includes application level data like queue-name, user
-    information etc in the ApplicationSubmissionContext, list of
-    application-attempts that ran for an application, information about each
-    application-attempt, list of containers run under each application-attempt,
-    and information about each container. Generic data is stored by
-    ResourceManager to a history-store (default implementation on a file-system)
-    and used by the web-UI to display information about completed applications.
-
-  ** Per-framework information of running and completed applications
-
-    Per-framework information is completely specific to an application or
-    framework. For example, Hadoop MapReduce framework can include pieces of
-    information like number of map tasks, reduce tasks, counters etc.
-    Application developers can publish the specific information to the Timeline
-    server via TimelineClient from within a client, the ApplicationMaster
-    and/or the application's containers. This information is then queryable via
-    REST APIs for rendering by application/framework specific UIs. 
-
-* Current Status
-
-  Timeline sever is a work in progress. The basic storage and retrieval of
-  information, both generic and framework specific, are in place. Timeline
-  server doesn't work in secure mode yet. The generic information and the
-  per-framework information are today collected and presented separately and
-  thus are not integrated well together. Finally, the per-framework information
-  is only available via RESTful APIs, using JSON type content - ability to
-  install framework specific UIs in YARN isn't supported yet.
-
-* Basic Configuration
-
-  Users need to configure the Timeline server before starting it. The simplest
-  configuration you should add in <<<yarn-site.xml>>> is to set the hostname of
-  the Timeline server:
-
-+---+
-<property>
-  <description>The hostname of the Timeline service web application.</description>
-  <name>yarn.timeline-service.hostname</name>
-  <value>0.0.0.0</value>
-</property>
-+---+
-
-* Advanced Configuration
-
-  In addition to the hostname, admins can also configure whether the service is
-  enabled or not, the ports of the RPC and the web interfaces, and the number
-  of RPC handler threads.
-
-+---+
-
-<property>
-  <description>Address for the Timeline server to start the RPC server.</description>
-  <name>yarn.timeline-service.address</name>
-  <value>${yarn.timeline-service.hostname}:10200</value>
-</property>
-
-<property>
-  <description>The http address of the Timeline service web application.</description>
-  <name>yarn.timeline-service.webapp.address</name>
-  <value>${yarn.timeline-service.hostname}:8188</value>
-</property>
-
-<property>
-  <description>The https address of the Timeline service web application.</description>
-  <name>yarn.timeline-service.webapp.https.address</name>
-  <value>${yarn.timeline-service.hostname}:8190</value>
-</property>
-
-<property>
-  <description>Handler thread count to serve the client RPC requests.</description>
-  <name>yarn.timeline-service.handler-thread-count</name>
-  <value>10</value>
-</property>
-
-<property>
-  <description>Enables cross-origin support (CORS) for web services where
-  cross-origin web response headers are needed. For example, javascript making
-  a web services request to the timeline server.</description>
-  <name>yarn.timeline-service.http-cross-origin.enabled</name>
-  <value>false</value>
-</property>
-
-<property>
-  <description>Comma separated list of origins that are allowed for web
-  services needing cross-origin (CORS) support. Wildcards (*) and patterns
-  allowed</description>
-  <name>yarn.timeline-service.http-cross-origin.allowed-origins</name>
-  <value>*</value>
-</property>
-
-<property>
-  <description>Comma separated list of methods that are allowed for web
-  services needing cross-origin (CORS) support.</description>
-  <name>yarn.timeline-service.http-cross-origin.allowed-methods</name>
-  <value>GET,POST,HEAD</value>
-</property>
-
-<property>
-  <description>Comma separated list of headers that are allowed for web
-  services needing cross-origin (CORS) support.</description>
-  <name>yarn.timeline-service.http-cross-origin.allowed-headers</name>
-  <value>X-Requested-With,Content-Type,Accept,Origin</value>
-</property>
-
-<property>
-  <description>The number of seconds a pre-flighted request can be cached
-  for web services needing cross-origin (CORS) support.</description>
-  <name>yarn.timeline-service.http-cross-origin.max-age</name>
-  <value>1800</value>
-</property>
-+---+
-
-* Generic-data related Configuration
-
-  Users can specify whether the generic data collection is enabled or not, and
-  also choose the storage-implementation class for the generic data. There are
-  more configurations related to generic data collection, and users can refer
-  to <<<yarn-default.xml>>> for all of them.
-
-+---+
-<property>
-  <description>Indicate to ResourceManager as well as clients whether
-  history-service is enabled or not. If enabled, ResourceManager starts
-  recording historical data that Timelien service can consume. Similarly,
-  clients can redirect to the history service when applications
-  finish if this is enabled.</description>
-  <name>yarn.timeline-service.generic-application-history.enabled</name>
-  <value>false</value>
-</property>
-
-<property>
-  <description>Store class name for history store, defaulting to file system
-  store</description>
-  <name>yarn.timeline-service.generic-application-history.store-class</name>
-  <value>org.apache.hadoop.yarn.server.applicationhistoryservice.FileSystemApplicationHistoryStore</value>
-</property>
-+---+
-
-* Per-framework-date related Configuration
-
-  Users can specify whether per-framework data service is enabled or not,
-  choose the store implementation for the per-framework data, and tune the
-  retention of the per-framework data. There are more configurations related to
-  per-framework data service, and users can refer to <<<yarn-default.xml>>> for
-  all of them.
-
-+---+
-<property>
-  <description>Indicate to clients whether Timeline service is enabled or not.
-  If enabled, the TimelineClient library used by end-users will post entities
-  and events to the Timeline server.</description>
-  <name>yarn.timeline-service.enabled</name>
-  <value>true</value>
-</property>
-
-<property>
-  <description>Store class name for timeline store.</description>
-  <name>yarn.timeline-service.store-class</name>
-  <value>org.apache.hadoop.yarn.server.timeline.LeveldbTimelineStore</value>
-</property>
-
-<property>
-  <description>Enable age off of timeline store data.</description>
-  <name>yarn.timeline-service.ttl-enable</name>
-  <value>true</value>
-</property>
-
-<property>
-  <description>Time to live for timeline store data in milliseconds.</description>
-  <name>yarn.timeline-service.ttl-ms</name>
-  <value>604800000</value>
-</property>
-+---+
-
-* Running Timeline server
-
-  Assuming all the aforementioned configurations are set properly, admins can
-  start the Timeline server/history service with the following command:
-
-+---+
-  $ yarn timelineserver
-+---+
-
-  Or users can start the Timeline server / history service as a daemon:
-
-+---+
-  $ yarn-daemon.sh start timelineserver
-+---+
-
-* Accessing generic-data via command-line
-
-  Users can access applications' generic historic data via the command line as
-  below. Note that the same commands are usable to obtain the corresponding
-  information about running applications.
-
-+---+
-  $ yarn application -status <Application ID>
-  $ yarn applicationattempt -list <Application ID>
-  $ yarn applicationattempt -status <Application Attempt ID>
-  $ yarn container -list <Application Attempt ID>
-  $ yarn container -status <Container ID>
-+---+
-
-* Publishing of per-framework data by applications
-
-  Developers can define what information they want to record for their
-  applications by composing <<<TimelineEntity>>> and <<<TimelineEvent>>>
-  objects, and put the entities and events to the Timeline server via
-  <<<TimelineClient>>>. Below is an example:
-
-+---+
-  // Create and start the Timeline client
-  TimelineClient client = TimelineClient.createTimelineClient();
-  client.init(conf);
-  client.start();
-
-  TimelineEntity entity = null;
-  // Compose the entity
-  try {
-    TimelinePutResponse response = client.putEntities(entity);
-  } catch (IOException e) {
-    // Handle the exception
-  } catch (YarnException e) {
-    // Handle the exception
-  }
-
-  // Stop the Timeline client
-  client.stop();
-+---+

http://git-wip-us.apache.org/repos/asf/hadoop/blob/aafe5713/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/WebApplicationProxy.apt.vm
----------------------------------------------------------------------
diff --git a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/WebApplicationProxy.apt.vm b/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/WebApplicationProxy.apt.vm
deleted file mode 100644
index 4646235..0000000
--- a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/WebApplicationProxy.apt.vm
+++ /dev/null
@@ -1,49 +0,0 @@
-~~ Licensed under the Apache License, Version 2.0 (the "License");
-~~ you may not use this file except in compliance with the License.
-~~ You may obtain a copy of the License at
-~~
-~~   http://www.apache.org/licenses/LICENSE-2.0
-~~
-~~ Unless required by applicable law or agreed to in writing, software
-~~ distributed under the License is distributed on an "AS IS" BASIS,
-~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-~~ See the License for the specific language governing permissions and
-~~ limitations under the License. See accompanying LICENSE file.
-
-  ---
-  YARN
-  ---
-  ---
-  ${maven.build.timestamp}
-
-Web Application Proxy
-
-  The Web Application Proxy is part of YARN.  By default it will run as part of
-  the Resource Manager(RM), but can be configured to run in stand alone mode.
-  The reason for the proxy is to reduce the possibility of web based attacks
-  through YARN.
-
-  In YARN the Application Master(AM) has the responsibility to provide a web UI
-  and to send that link to the RM.  This opens up a number of potential
-  issues.  The RM runs as a trusted user, and people visiting that web
-  address will treat it, and links it provides to them as trusted, when in
-  reality the AM is running as a non-trusted user, and the links it gives to
-  the RM could point to anything malicious or otherwise.  The Web Application
-  Proxy mitigates this risk by warning users that do not own the given
-  application that they are connecting to an untrusted site.
-
-  In addition to this the proxy also tries to reduce the impact that a malicious
-  AM could have on a user.  It primarily does this by stripping out cookies from
-  the user, and replacing them with a single cookie providing the user name of
-  the logged in user.  This is because most web based authentication systems will
-  identify a user based off of a cookie.  By providing this cookie to an
-  untrusted application it opens up the potential for an exploit.  If the cookie
-  is designed properly that potential should be fairly minimal, but this is just
-  to reduce that potential attack vector.  The current proxy implementation does
-  nothing to prevent the AM from providing links to malicious external sites,
-  nor does it do anything to prevent malicious javascript code from running as
-  well.  In fact javascript can be used to get the cookies, so stripping the
-  cookies from the request has minimal benefit at this time.
-
-  In the future we hope to address the attack vectors described above and make
-  attaching to an AM's web UI safer.

http://git-wip-us.apache.org/repos/asf/hadoop/blob/aafe5713/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/WebServicesIntro.apt.vm
----------------------------------------------------------------------
diff --git a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/WebServicesIntro.apt.vm b/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/WebServicesIntro.apt.vm
deleted file mode 100644
index 5300b94..0000000
--- a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/apt/WebServicesIntro.apt.vm
+++ /dev/null
@@ -1,593 +0,0 @@
-~~ Licensed under the Apache License, Version 2.0 (the "License");
-~~ you may not use this file except in compliance with the License.
-~~ You may obtain a copy of the License at
-~~
-~~   http://www.apache.org/licenses/LICENSE-2.0
-~~
-~~ Unless required by applicable law or agreed to in writing, software
-~~ distributed under the License is distributed on an "AS IS" BASIS,
-~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-~~ See the License for the specific language governing permissions and
-~~ limitations under the License. See accompanying LICENSE file.
-
-  ---
-  Hadoop YARN - Introduction to the web services REST API's.
-  ---
-  ---
-  ${maven.build.timestamp}
-
-Hadoop YARN - Introduction to the web services REST API's.
-
-%{toc|section=1|fromDepth=0}
-
-* Overview
-
-  The Hadoop YARN web service REST APIs are a set of URI resources that give access to the cluster, nodes, applications, and application historical information. The URI resources are grouped into APIs based on the type of information returned. Some URI resources return collections while others return singletons. 
-  
-* URI's
-
-  The URIs for the REST-based Web services have the following syntax:
-
-------
-  http://{http address of service}/ws/{version}/{resourcepath}
-------
-
-  The elements in this syntax are as follows:
-
-------
-  {http address of service} - The http address of the service to get information about. 
-                              Currently supported are the ResourceManager, NodeManager, 
-                              MapReduce application master, and history server.
-  {version} - The version of the APIs. In this release, the version is v1.
-  {resourcepath} - A path that defines a singleton resource or a collection of resources. 
-------
-
-* HTTP Requests
-
-  To invoke a REST API, your application calls an HTTP operation on the URI associated with a resource. 
-
-** Summary of HTTP operations
- 
-  Currently only GET is supported. It retrieves information about the resource specified.
-
-** Security
-
-  The web service REST API's go through the same security as the web ui.  If your cluster adminstrators have filters enabled you must authenticate via the mechanism they specified. 
-
-** Headers Supported
-
------
-  * Accept 
-  * Accept-Encoding
------
-
-  Currently the only fields used in the header is Accept and Accept-Encoding.  Accept currently supports XML and JSON for the response type you accept. Accept-Encoding currently only supports gzip format and will return gzip compressed output if this is specified, otherwise output is uncompressed. All other header fields are ignored.
-
-* HTTP Responses
-
-  The next few sections describe some of the syntax and other details of the HTTP Responses of the web service REST APIs.
-
-** Compression 
-
-  This release supports gzip compression if you specify gzip in the Accept-Encoding header of the HTTP request (Accept-Encoding: gzip).
-
-** Response Formats
-
-  This release of the web service REST APIs supports responses in JSON and XML formats. JSON is the default. To set the response format, you can specify the format in the Accept header of the HTTP request. 
-
-  As specified in HTTP Response Codes, the response body can contain the data that represents the resource or an error message. In the case of success, the response body is in the selected format, either JSON or XML. In the case of error, the resonse body is in either JSON or XML based on the format requested. The Content-Type header of the response contains the format requested. If the application requests an unsupported format, the response status code is 500.
-Note that the order of the fields within response body is not specified and might change. Also, additional fields might be added to a response body. Therefore, your applications should use parsing routines that can extract data from a response body in any order.
-
-** Response Errors
-
-  After calling an HTTP request, an application should check the response status code to verify success or detect an error. If the response status code indicates an error, the response body contains an error message. The first field is the exception type, currently only RemoteException is returned. The following table lists the items within the RemoteException error message:
-
-*---------------*--------------*-------------------------------*
-|| Item         || Data Type   || Description                  |
-*---------------+--------------+-------------------------------+
-| exception     | String       | Exception type                |
-*---------------+--------------+-------------------------------+
-| javaClassName | String       | Java class name of exception  |
-*---------------+--------------+-------------------------------+
-| message       | String       | Detailed message of exception |
-*---------------+--------------+-------------------------------+
-
-** Response Examples
-
-*** JSON response with single resource
-
-  HTTP Request:
-  GET http://rmhost.domain:8088/ws/v1/cluster/app/application_1324057493980_0001
-
-  Response Status Line:
-  HTTP/1.1 200 OK
-
-  Response Header:
-
-+---+
-  HTTP/1.1 200 OK
-  Content-Type: application/json
-  Transfer-Encoding: chunked
-  Server: Jetty(6.1.26)
-+---+
-
-  Response Body:
-
-+---+
-{
-  app":
-  {
-    "id":"application_1324057493980_0001",
-    "user":"user1",
-    "name":"",
-    "queue":"default",
-    "state":"ACCEPTED",
-    "finalStatus":"UNDEFINED",
-    "progress":0,
-    "trackingUI":"UNASSIGNED",
-    "diagnostics":"",
-    "clusterId":1324057493980,
-    "startedTime":1324057495921,
-    "finishedTime":0,
-    "elapsedTime":2063,
-    "amContainerLogs":"http:\/\/amNM:2\/node\/containerlogs\/container_1324057493980_0001_01_000001",
-    "amHostHttpAddress":"amNM:2"
-  }
-}
-+---+
-
-*** JSON response with Error response
-
-  Here we request information about an application that doesn't exist yet.
-
-  HTTP Request:
-  GET http://rmhost.domain:8088/ws/v1/cluster/app/application_1324057493980_9999
-
-  Response Status Line:
-  HTTP/1.1 404 Not Found
-
-  Response Header:
-
-+---+
-  HTTP/1.1 404 Not Found
-  Content-Type: application/json
-  Transfer-Encoding: chunked
-  Server: Jetty(6.1.26)
-+---+
-
-  Response Body:
-
-+---+
-{
-   "RemoteException" : {
-      "javaClassName" : "org.apache.hadoop.yarn.webapp.NotFoundException",
-      "exception" : "NotFoundException",
-      "message" : "java.lang.Exception: app with id: application_1324057493980_9999 not found"
-   }
-}
-+---+
-
-* Example usage
-
-  You can use any number of ways/languages to use the web services REST API's. This example uses the curl command line interface to do the REST GET calls.
-
-  In this example, a user submits a MapReduce application to the ResourceManager using a command like: 
-  
-+---+
-  hadoop jar hadoop-mapreduce-test.jar sleep -Dmapred.job.queue.name=a1 -m 1 -r 1 -rt 1200000 -mt 20
-+---+
-
-  The client prints information about the job submitted along with the application id, similar to:
-
-+---+
-12/01/18 04:25:15 INFO mapred.ResourceMgrDelegate: Submitted application application_1326821518301_0010 to ResourceManager at host.domain.com/10.10.10.10:8032
-12/01/18 04:25:15 INFO mapreduce.Job: Running job: job_1326821518301_0010
-12/01/18 04:25:21 INFO mapred.ClientServiceDelegate: The url to track the job: host.domain.com:8088/proxy/application_1326821518301_0010/
-12/01/18 04:25:22 INFO mapreduce.Job: Job job_1326821518301_0010 running in uber mode : false
-12/01/18 04:25:22 INFO mapreduce.Job:  map 0% reduce 0%
-+---+
-
-  The user then wishes to track the application. The users starts by getting the information about the application from the ResourceManager. Use the --comopressed option to request output compressed. curl handles uncompressing on client side.
-
-+---+
-curl --compressed -H "Accept: application/json" -X GET "http://host.domain.com:8088/ws/v1/cluster/apps/application_1326821518301_0010" 
-+---+
-
-  Output:
-
-+---+
-{
-   "app" : {
-      "finishedTime" : 0,
-      "amContainerLogs" : "http://host.domain.com:8042/node/containerlogs/container_1326821518301_0010_01_000001",
-      "trackingUI" : "ApplicationMaster",
-      "state" : "RUNNING",
-      "user" : "user1",
-      "id" : "application_1326821518301_0010",
-      "clusterId" : 1326821518301,
-      "finalStatus" : "UNDEFINED",
-      "amHostHttpAddress" : "host.domain.com:8042",
-      "progress" : 82.44703,
-      "name" : "Sleep job",
-      "startedTime" : 1326860715335,
-      "elapsedTime" : 31814,
-      "diagnostics" : "",
-      "trackingUrl" : "http://host.domain.com:8088/proxy/application_1326821518301_0010/",
-      "queue" : "a1"
-   }
-}
-+---+
-
-  The user then wishes to get more details about the running application and goes directly to the MapReduce application master for this application. The ResourceManager lists the trackingUrl that can be used for this application: http://host.domain.com:8088/proxy/application_1326821518301_0010. This could either go to the web browser or use the web service REST API's. The user uses the web services REST API's to get the list of jobs this MapReduce application master is running:
-
-+---+
- curl --compressed -H "Accept: application/json" -X GET "http://host.domain.com:8088/proxy/application_1326821518301_0010/ws/v1/mapreduce/jobs"
-+---+
-
-  Output:
-
-+---+
-{
-   "jobs" : {
-      "job" : [
-         {
-            "runningReduceAttempts" : 1,
-            "reduceProgress" : 72.104515,
-            "failedReduceAttempts" : 0,
-            "newMapAttempts" : 0,
-            "mapsRunning" : 0,
-            "state" : "RUNNING",
-            "successfulReduceAttempts" : 0,
-            "reducesRunning" : 1,
-            "acls" : [
-               {
-                  "value" : " ",
-                  "name" : "mapreduce.job.acl-modify-job"
-               },
-               {
-                  "value" : " ",
-                  "name" : "mapreduce.job.acl-view-job"
-               }
-            ],
-            "reducesPending" : 0,
-            "user" : "user1",
-            "reducesTotal" : 1,
-            "mapsCompleted" : 1,
-            "startTime" : 1326860720902,
-            "id" : "job_1326821518301_10_10",
-            "successfulMapAttempts" : 1,
-            "runningMapAttempts" : 0,
-            "newReduceAttempts" : 0,
-            "name" : "Sleep job",
-            "mapsPending" : 0,
-            "elapsedTime" : 64432,
-            "reducesCompleted" : 0,
-            "mapProgress" : 100,
-            "diagnostics" : "",
-            "failedMapAttempts" : 0,
-            "killedReduceAttempts" : 0,
-            "mapsTotal" : 1,
-            "uberized" : false,
-            "killedMapAttempts" : 0,
-            "finishTime" : 0
-         }
-      ]
-   }
-}
-+---+
-
-  The user then wishes to get the task details about the job with job id job_1326821518301_10_10 that was listed above. 
-
-+---+
- curl --compressed -H "Accept: application/json" -X GET "http://host.domain.com:8088/proxy/application_1326821518301_0010/ws/v1/mapreduce/jobs/job_1326821518301_10_10/tasks" 
-+---+
-
-  Output:
-
-+---+
-{
-   "tasks" : {
-      "task" : [
-         {
-            "progress" : 100,
-            "elapsedTime" : 5059,
-            "state" : "SUCCEEDED",
-            "startTime" : 1326860725014,
-            "id" : "task_1326821518301_10_10_m_0",
-            "type" : "MAP",
-            "successfulAttempt" : "attempt_1326821518301_10_10_m_0_0",
-            "finishTime" : 1326860730073
-         },
-         {
-            "progress" : 72.104515,
-            "elapsedTime" : 0,
-            "state" : "RUNNING",
-            "startTime" : 1326860732984,
-            "id" : "task_1326821518301_10_10_r_0",
-            "type" : "REDUCE",
-            "successfulAttempt" : "",
-            "finishTime" : 0
-         }
-      ]
-   }
-}
-+---+
-
-  The map task has finished but the reduce task is still running. The users wishes to get the task attempt information for the reduce task task_1326821518301_10_10_r_0, note that the Accept header isn't really required here since JSON is the default output format:
-
-+---+
-  curl --compressed -X GET "http://host.domain.com:8088/proxy/application_1326821518301_0010/ws/v1/mapreduce/jobs/job_1326821518301_10_10/tasks/task_1326821518301_10_10_r_0/attempts"
-+---+
-
-  Output:
-
-+---+
-{
-   "taskAttempts" : {
-      "taskAttempt" : [
-         {
-            "elapsedMergeTime" : 158,
-            "shuffleFinishTime" : 1326860735378,
-            "assignedContainerId" : "container_1326821518301_0010_01_000003",
-            "progress" : 72.104515,
-            "elapsedTime" : 0,
-            "state" : "RUNNING",
-            "elapsedShuffleTime" : 2394,
-            "mergeFinishTime" : 1326860735536,
-            "rack" : "/10.10.10.0",
-            "elapsedReduceTime" : 0,
-            "nodeHttpAddress" : "host.domain.com:8042",
-            "type" : "REDUCE",
-            "startTime" : 1326860732984,
-            "id" : "attempt_1326821518301_10_10_r_0_0",
-            "finishTime" : 0
-         }
-      ]
-   }
-}
-+---+
-
-  The reduce attempt is still running and the user wishes to see the current counter values for that attempt:
-
-+---+
- curl --compressed -H "Accept: application/json"  -X GET "http://host.domain.com:8088/proxy/application_1326821518301_0010/ws/v1/mapreduce/jobs/job_1326821518301_10_10/tasks/task_1326821518301_10_10_r_0/attempts/attempt_1326821518301_10_10_r_0_0/counters" 
-+---+
-
-  Output:
-
-+---+
-{
-   "JobTaskAttemptCounters" : {
-      "taskAttemptCounterGroup" : [
-         {
-            "counterGroupName" : "org.apache.hadoop.mapreduce.FileSystemCounter",
-            "counter" : [
-               {
-                  "value" : 4216,
-                  "name" : "FILE_BYTES_READ"
-               }, 
-               {
-                  "value" : 77151,
-                  "name" : "FILE_BYTES_WRITTEN"
-               }, 
-               {
-                  "value" : 0,
-                  "name" : "FILE_READ_OPS"
-               },
-               {
-                  "value" : 0,
-                  "name" : "FILE_LARGE_READ_OPS"
-               },
-               {
-                  "value" : 0,
-                  "name" : "FILE_WRITE_OPS"
-               },
-               {
-                  "value" : 0,
-                  "name" : "HDFS_BYTES_READ"
-               },
-               {
-                  "value" : 0,
-                  "name" : "HDFS_BYTES_WRITTEN"
-               },
-               {
-                  "value" : 0,
-                  "name" : "HDFS_READ_OPS"
-               },
-               {
-                  "value" : 0,
-                  "name" : "HDFS_LARGE_READ_OPS"
-               },
-               {
-                  "value" : 0,
-                  "name" : "HDFS_WRITE_OPS"
-               }
-            ]  
-         }, 
-         {
-            "counterGroupName" : "org.apache.hadoop.mapreduce.TaskCounter",
-            "counter" : [
-               {
-                  "value" : 0,
-                  "name" : "COMBINE_INPUT_RECORDS"
-               }, 
-               {
-                  "value" : 0,
-                  "name" : "COMBINE_OUTPUT_RECORDS"
-               }, 
-               {  
-                  "value" : 1767,
-                  "name" : "REDUCE_INPUT_GROUPS"
-               },
-               {  
-                  "value" : 25104,
-                  "name" : "REDUCE_SHUFFLE_BYTES"
-               },
-               {
-                  "value" : 1767,
-                  "name" : "REDUCE_INPUT_RECORDS"
-               },
-               {
-                  "value" : 0,
-                  "name" : "REDUCE_OUTPUT_RECORDS"
-               },
-               {
-                  "value" : 0,
-                  "name" : "SPILLED_RECORDS"
-               },
-               {
-                  "value" : 1,
-                  "name" : "SHUFFLED_MAPS"
-               },
-               {
-                  "value" : 0,
-                  "name" : "FAILED_SHUFFLE"
-               },
-               {
-                  "value" : 1,
-                  "name" : "MERGED_MAP_OUTPUTS"
-               },
-               {
-                  "value" : 50,
-                  "name" : "GC_TIME_MILLIS"
-               },
-               {
-                  "value" : 1580,
-                  "name" : "CPU_MILLISECONDS"
-               },
-               {
-                  "value" : 141320192,
-                  "name" : "PHYSICAL_MEMORY_BYTES"
-               },
-              {
-                  "value" : 1118552064,
-                  "name" : "VIRTUAL_MEMORY_BYTES"
-               }, 
-               {  
-                  "value" : 73728000,
-                  "name" : "COMMITTED_HEAP_BYTES"
-               }
-            ]
-         },
-         {  
-            "counterGroupName" : "Shuffle Errors",
-            "counter" : [
-               {  
-                  "value" : 0,
-                  "name" : "BAD_ID"
-               },
-               {  
-                  "value" : 0,
-                  "name" : "CONNECTION"
-               },
-               {  
-                  "value" : 0,
-                  "name" : "IO_ERROR"
-               },
-               {  
-                  "value" : 0,
-                  "name" : "WRONG_LENGTH"
-               },
-               {  
-                  "value" : 0,
-                  "name" : "WRONG_MAP"
-               },
-               {  
-                  "value" : 0,
-                  "name" : "WRONG_REDUCE"
-               }
-            ]
-         },
-         {  
-            "counterGroupName" : "org.apache.hadoop.mapreduce.lib.output.FileOutputFormatCounter",
-            "counter" : [
-              {  
-                  "value" : 0,
-                  "name" : "BYTES_WRITTEN"
-               }
-            ]
-         }
-      ],
-      "id" : "attempt_1326821518301_10_10_r_0_0"
-   }
-}
-+---+
-
-  The job finishes and the user wishes to get the final job information from the history server for this job.  
-
-+---+
-  curl --compressed -X GET "http://host.domain.com:19888/ws/v1/history/mapreduce/jobs/job_1326821518301_10_10" 
-+---+
-
-  Output:
-
-+---+
-{
-   "job" : {
-      "avgReduceTime" : 1250784,
-      "failedReduceAttempts" : 0,
-      "state" : "SUCCEEDED",
-      "successfulReduceAttempts" : 1,
-      "acls" : [
-         {
-            "value" : " ",
-            "name" : "mapreduce.job.acl-modify-job"
-         },
-         {
-            "value" : " ",
-            "name" : "mapreduce.job.acl-view-job"
-         }
-      ],
-      "user" : "user1",
-      "reducesTotal" : 1,
-      "mapsCompleted" : 1,
-      "startTime" : 1326860720902,
-      "id" : "job_1326821518301_10_10",
-      "avgMapTime" : 5059,
-      "successfulMapAttempts" : 1,
-      "name" : "Sleep job",
-      "avgShuffleTime" : 2394,
-      "reducesCompleted" : 1,
-      "diagnostics" : "",
-      "failedMapAttempts" : 0,
-      "avgMergeTime" : 2552,
-      "killedReduceAttempts" : 0,
-      "mapsTotal" : 1,
-      "queue" : "a1",
-      "uberized" : false,
-      "killedMapAttempts" : 0,
-      "finishTime" : 1326861986164
-   }
-}
-+---+
-
-  The user also gets the final applications information from the ResourceManager.
-
-+---+
-  curl --compressed -H "Accept: application/json" -X GET "http://host.domain.com:8088/ws/v1/cluster/apps/application_1326821518301_0010" 
-+---+
-
-  Output:
-
-+---+
-{
-   "app" : {
-      "finishedTime" : 1326861991282,
-      "amContainerLogs" : "http://host.domain.com:8042/node/containerlogs/container_1326821518301_0010_01_000001",
-      "trackingUI" : "History",
-      "state" : "FINISHED",
-      "user" : "user1",
-      "id" : "application_1326821518301_0010",
-      "clusterId" : 1326821518301,
-      "finalStatus" : "SUCCEEDED",
-      "amHostHttpAddress" : "host.domain.com:8042",
-      "progress" : 100,
-      "name" : "Sleep job",
-      "startedTime" : 1326860715335,
-      "elapsedTime" : 1275947,
-      "diagnostics" : "",
-      "trackingUrl" : "http://host.domain.com:8088/proxy/application_1326821518301_0010/jobhistory/job/job_1326821518301_10_10",
-      "queue" : "a1"
-   }
-}
-+---+


Mime
View raw message