beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject [1/2] beam git commit: Remove apex This information is already in the website in quickstart and apex runner pages. moves small bits of missing content.
Date Mon, 08 May 2017 21:28:48 GMT
Repository: beam
Updated Branches:
  refs/heads/master d1914ed0c -> 962e8796d

Remove apex This information is already in the website in
quickstart and apex runner pages. moves small bits of missing


Branch: refs/heads/master
Commit: b7635aadc2532b5c3bdc38f070865091ebf901c1
Parents: d1914ed
Author: Ahmet Altay <>
Authored: Mon May 8 13:42:02 2017 -0700
Committer: Ahmet Altay <>
Committed: Mon May 8 14:28:31 2017 -0700

 runners/apex/ | 76 ---------------------------------------------
 1 file changed, 76 deletions(-)
diff --git a/runners/apex/ b/runners/apex/
deleted file mode 100644
index b9bc74f..0000000
--- a/runners/apex/
+++ /dev/null
@@ -1,76 +0,0 @@
-    Licensed to the Apache Software Foundation (ASF) under one
-    or more contributor license agreements.  See the NOTICE file
-    distributed with this work for additional information
-    regarding copyright ownership.  The ASF licenses this file
-    to you under the Apache License, Version 2.0 (the
-    "License"); you may not use this file except in compliance
-    with the License.  You may obtain a copy of the License at
-    Unless required by applicable law or agreed to in writing,
-    software distributed under the License is distributed on an
-    KIND, either express or implied.  See the License for the
-    specific language governing permissions and limitations
-    under the License.
-Apex Beam Runner ﴾Apex‐Runner﴿
-Apex‐Runner is a Runner for Apache Beam which executes Beam pipelines with Apache Apex
as underlying engine. The runner has broad support for the Beam model and supports streaming
and batch pipelines. 
-[Apache Apex]( is a stream processing platform and framework for
low-latency, high-throughput and fault-tolerant analytics applications on Apache Hadoop. Apex
is Java based and also provides its own API for application development (native compositional
and declarative Java API, SQL) with a comprehensive [operator library](
Apex has a unified streaming architecture and can be used for real-time and batch processing.
With its stateful stream processing architecture Apex can support all of the concepts in the
Beam model (event time, triggers, watermarks etc.).
-Apex-Runner is relatively new. It is fully functional and can currently be used to run pipelines
in embedded mode. It does not take advantage of all the performance and scalability that Apex
can deliver. This is expected to be addressed with upcoming work, leveraging features like
incremental checkpointing, partitioning and operator affinity from Apex. Please see [JIRA](
and we welcome contributions!
-##Getting Started
-The following shows how to run the WordCount example that is provided with the source code
on Apex (the example is identical with the one provided as part of the Beam examples). 
-###Installing Beam
-To get the latest version of Beam with Apex-Runner, first clone the Beam repository:
-git clone
-Then switch to the newly created directory and run Maven to build the Apache Beam:
-cd beam
-mvn clean install -DskipTests
-Now Apache Beam and the Apex Runner are installed in your local Maven repository.
-###Running an Example
-Download something to count:
-curl > /tmp/kinglear.txt
-Run the pipeline, using the Apex runner:
-cd examples/java
-mvn exec:java -Dexec.mainClass=org.apache.beam.examples.WordCount -Dexec.args="--inputFile=/tmp/kinglear.txt
--output=/tmp/wordcounts.txt --runner=ApexRunner" -Papex-runner
-Once completed, there will be multiple output files with the base name given above:
-$ ls /tmp/out-*
-/tmp/out-00000-of-00003  /tmp/out-00001-of-00003  /tmp/out-00002-of-00003
-##Running pipelines on an Apex YARN cluster
-Coming soon.

View raw message