Return-Path: X-Original-To: apmail-incubator-bigtop-commits-archive@minotaur.apache.org Delivered-To: apmail-incubator-bigtop-commits-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4F0469ED0 for ; Mon, 25 Jun 2012 18:23:27 +0000 (UTC) Received: (qmail 32186 invoked by uid 500); 25 Jun 2012 18:23:27 -0000 Delivered-To: apmail-incubator-bigtop-commits-archive@incubator.apache.org Received: (qmail 32153 invoked by uid 500); 25 Jun 2012 18:23:27 -0000 Mailing-List: contact bigtop-commits-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: bigtop-dev@incubator.apache.org Delivered-To: mailing list bigtop-commits@incubator.apache.org Received: (qmail 32143 invoked by uid 99); 25 Jun 2012 18:23:27 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2012 18:23:27 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO eris.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2012 18:23:23 +0000 Received: from eris.apache.org (localhost [127.0.0.1]) by eris.apache.org (Postfix) with ESMTP id 5793423889FD; Mon, 25 Jun 2012 18:23:03 +0000 (UTC) Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Subject: svn commit: r1353693 [1/2] - in /incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources: apt/package_data.xml package_data.xml yum/package_data.xml zypper/package_data.xml Date: Mon, 25 Jun 2012 18:23:02 -0000 To: bigtop-commits@incubator.apache.org From: rvs@apache.org X-Mailer: svnmailer-1.0.8-patched Message-Id: <20120625182303.5793423889FD@eris.apache.org> X-Virus-Checked: Checked by ClamAV on apache.org Author: rvs Date: Mon Jun 25 18:23:02 2012 New Revision: 1353693 URL: http://svn.apache.org/viewvc?rev=1353693&view=rev Log: BIGTOP-638. move block from common package_date.xml to each platform's (yum, apt, zypper) package_data.xml (Johnny Zhang via rvs) Modified: incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/apt/package_data.xml incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/package_data.xml incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/yum/package_data.xml incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/zypper/package_data.xml Modified: incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/apt/package_data.xml URL: http://svn.apache.org/viewvc/incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/apt/package_data.xml?rev=1353693&r1=1353692&r2=1353693&view=diff ============================================================================== --- incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/apt/package_data.xml (original) +++ incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/apt/package_data.xml Mon Jun 25 18:23:02 2012 @@ -1,4 +1,4 @@ - + - - - - /var/run/flume - Flume User - /bin/false - - - - - - - - - - /var/lib/sqoop - Sqoop User - /bin/false - - - - - - - - - - /var/lib/zookeeper - ZooKeeper User - /bin/false - - - - - - - - - - /var/lib/oozie - Oozie User - /bin/false - - - - - - - - - - - - - - - - - /var/lib/hive - Hive User - /bin/false - - - - - - - /var/lib/hive - Hive User - /bin/false - - - - - - - /var/run/hbase - HBase User - /bin/bash - - - - - - - - - - - - - - - - /var/lib/hadoop-hdfs - Hadoop HDFS - /bin/bash - - - - - - - - - - - /var/lib/hadoop-yarn - Hadoop YARN - /bin/bash - - - - - - - - - - - /var/lib/hadoop-mapreduce - Hadoop MapReduce - /bin/bash - - - - - - - - - - /var/run/hadoop-httpfs - Hadoop HTTPFS - /bin/bash - - - - - -

Hadoop Filesystem Library

- Hadoop Filesystem Library - http://hadoop.apache.org/core/ - - - /self - >=2.4 - - - - -

Development support for libhdfs0

- Includes examples and header files for accessing HDFS from C - http://hadoop.apache.org/core/ - - - /self - /self - - + + + + /var/run/flume + Flume User + /bin/false + + + + + + + + + + + + + + + >=1.6 + + + + + + 2 + 3 + 4 + 5 + start + true + + + + + + + /var/lib/sqoop + Sqoop User + /bin/false + + + + + + + + 2 + 3 + 4 + 5 + start + true + + + + + + + /var/lib/zookeeper + ZooKeeper User + /bin/false + + + + + + + + + + + + + + 2 + 3 + 4 + 5 + start + true + true + + + + + + + /var/lib/oozie + Oozie User + /bin/false + + + + + + + + + + 2 + 3 + 4 + 5 + stop + true + + + + + + + + + + + + /var/lib/hive + Hive User + /bin/false + + + + + 2 + 3 + 4 + 5 + start + true + + + + + + + /var/lib/hive + Hive User + /bin/false + + + + + 2 + 3 + 4 + 5 + start + true + + + + + + + /var/run/hbase + HBase User + /bin/bash + + + + + + + + + + + + + + + + 2 + 3 + 4 + 5 + start + true + + /etc/hbase/conf/hbase-site.xml + + hbase.rootdir + hdfs://localhost/hbase + + + hbase.cluster.distributed + true + + + + + + + + + + + + + 2 + 3 + 4 + 5 + stop + false + + + + + + + + + + + 2 + 3 + 4 + 5 + start + false + + + + + + /self + + + + 2 + 3 + 4 + 5 + start + false + + + + + + + + + + + + + /var/lib/hadoop-hdfs + Hadoop HDFS + /bin/bash + + + + + + + + + + + + + + + + 2 + 3 + 4 + 5 + stop + false + true + + + + + + + 2 + 3 + 4 + 5 + stop + false + + + + + + + + + + + + 2 + 3 + 4 + 5 + stop + false + + + + + + + + + + + + 2 + 3 + 4 + 5 + stop + false + + + + + + + /var/lib/hadoop-yarn + Hadoop YARN + /bin/bash + + + + + + + + + + + + + + + + 2 + 3 + 4 + 5 + stop + false + + + + + + + + + + + + 2 + 3 + 4 + 5 + stop + false + + + + + + + + + + + 2 + 3 + 4 + 5 + stop + false + + /etc/hadoop/conf/yarn-site.xml + + yarn.web-proxy.address + 0.0.0.0:8032 + + + + + + + + + + + + + 2 + 3 + 4 + 5 + stop + false + + + + + + + /var/lib/hadoop-mapreduce + Hadoop MapReduce + /bin/bash + + + + + + + + + + /var/run/hadoop-httpfs + Hadoop HTTPFS + /bin/bash + + + + + 2 + 3 + 4 + 5 + stop + false + + + + + +

Hadoop Filesystem Library

+ Hadoop Filesystem Library + http://hadoop.apache.org/core/ + + + /self + >=2.4 + + + + +

Development support for libhdfs0

+ Includes examples and header files for accessing HDFS from C + http://hadoop.apache.org/core/ + + + /self + /self + + + + + + 2 + 3 + 4 + 5 + start + true + + + Modified: incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/package_data.xml URL: http://svn.apache.org/viewvc/incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/package_data.xml?rev=1353693&r1=1353692&r2=1353693&view=diff ============================================================================== --- incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/package_data.xml (original) +++ incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/package_data.xml Mon Jun 25 18:23:02 2012 @@ -1,4 +1,4 @@ - + - - -

Collection of useful tools for Bigtop

- This includes a collection of useful tools and files for Bigtop - http://incubator.apache.org/bigtop/ - - - - -

Application to launch java daemon

- jsvc executes classfile that implements a Daemon interface. - http://commons.apache.org/daemon/ - - - - -

Apache Tomcat

- Apache Tomcat is an open source software implementation of the + + +

Collection of useful tools for Bigtop

+ This includes a collection of useful tools and files for Bigtop + http://incubator.apache.org/bigtop/ + + + + +

Application to launch java daemon

+ jsvc executes classfile that implements a Daemon interface. + http://commons.apache.org/daemon/ + + + + +

Apache Tomcat

+ Apache Tomcat is an open source software implementation of the Java Servlet and JavaServer Pages technologies. - http://tomcat.apache.org/ - - - - -

A set of Java libraries for scalable machine learning.

- Mahout's goal is to build scalable machine learning libraries. + http://tomcat.apache.org/ + + + + +

A set of Java libraries for scalable machine learning.

+ Mahout's goal is to build scalable machine learning libraries. With scalable we mean: . Scalable to reasonably large data sets. Our core algorithms for clustering, @@ -56,44 +56,44 @@ Java Servlet and JavaServer Pages techno Scalable community. The goal of Mahout is to build a vibrant, responsive, diverse community to facilitate discussions not only on the project itself but also on potential use cases. Come to the mailing lists to find out more. - http://mahout.apache.org - - - - - - - - auto - /etc/mahout/conf - /etc/mahout/conf.dist - /etc/mahout/conf.dist - - - - - -

Giraph is a BSP inspired graph processing platform that runs on Hadoop

- Giraph implements a graph processing platform to run large scale algorithms (such as page rank, shared connections, personalization-based popularity, etc.) on top of Hadoop infrastructure. Giraph builds upon the graph-oriented nature of Pregel but additionally adds fault-tolerance to the coordinator process with the use of ZooKeeper as its centralized coordination service. - http://incubator.apache.org/giraph/ - - - - - - - - auto - /etc/giraph/conf - /etc/giraph/conf.dist - /etc/giraph/conf.dist - - - - - -

Scripts and libraries for running software services on cloud infrastructure.

- Whirr provides + http://mahout.apache.org + + + + + + + + auto + /etc/mahout/conf + /etc/mahout/conf.dist + /etc/mahout/conf.dist + + + + + +

Giraph is a BSP inspired graph processing platform that runs on Hadoop

+ Giraph implements a graph processing platform to run large scale algorithms (such as page rank, shared connections, personalization-based popularity, etc.) on top of Hadoop infrastructure. Giraph builds upon the graph-oriented nature of Pregel but additionally adds fault-tolerance to the coordinator process with the use of ZooKeeper as its centralized coordination service. + http://incubator.apache.org/giraph/ + + + + + + + + auto + /etc/giraph/conf + /etc/giraph/conf.dist + /etc/giraph/conf.dist + + + + + +

Scripts and libraries for running software services on cloud infrastructure.

+ Whirr provides . * A cloud-neutral way to run services. You don't have to worry about the idiosyncrasies of each provider. @@ -102,16 +102,16 @@ Java Servlet and JavaServer Pages techno * Smart defaults for services. You can get a properly configured system running quickly, while still being able to override settings as needed. - http://whirr.apache.org/ - - - - - - - -

Flume is a reliable, scalable, and manageable distributed log collection application for collecting data such as logs and delivering it to data stores such as Hadoop's HDFS.

- Flume is a reliable, scalable, and manageable distributed data collection + http://whirr.apache.org/ + + + + + + + +

Flume is a reliable, scalable, and manageable distributed log collection application for collecting data such as logs and delivering it to data stores such as Hadoop's HDFS.

+ Flume is a reliable, scalable, and manageable distributed data collection application for collecting data such as logs and delivering it to data stores such as Hadoop's HDFS. It can efficiently collect, aggregate, and move large amounts of log data. It has a simple, but flexible, architecture based on @@ -119,89 +119,75 @@ Java Servlet and JavaServer Pages techno mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications. - http://incubator.apache.org/projects/flume.html - - - - - - - - - flume - - - - - auto - /etc/flume/conf - /etc/flume/conf.empty - /etc/flume/conf.empty - - - - - -

The flume node daemon is a core element of flume's data path and is responsible for generating, processing, and delivering data.

- Flume is a reliable, scalable, and manageable distributed data collection application for collecting data such as logs and delivering it to data stores such as Hadoop's HDFS. It can efficiently collect, aggregate, and move large amounts of log data. It has a simple, but flexible, architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications. - http://incubator.apache.org/projects/flume.html - - - /self - - - - 2345 - start - true - - - - - -

Sqoop allows easy imports and exports of data sets between databases and the Hadoop Distributed File System (HDFS).

- Sqoop allows easy imports and exports of data sets between databases and the Hadoop Distributed File System (HDFS). - http://incubator.apache.org/sqoop/ - - - - - - - - auto - /etc/sqoop/conf - /etc/sqoop/conf.dist - /etc/sqoop/conf.dist - - - - - -

Shared metadata repository for Sqoop.

- Shared metadata repository for Sqoop. This optional package hosts a metadata server for Sqoop clients across a network to use. - http://incubator.apache.org/sqoop/ - - - /self - - - - 2345 - start - true - - - - - sqoop - - - - - -

Oozie is a system that runs workflows of Hadoop jobs.

- Oozie is a system that runs workflows of Hadoop jobs. + http://incubator.apache.org/projects/flume.html + + + + + + + + + flume + + + + + auto + /etc/flume/conf + /etc/flume/conf.empty + /etc/flume/conf.empty + + + + + +

The flume node daemon is a core element of flume's data path and is responsible for generating, processing, and delivering data.

+ Flume is a reliable, scalable, and manageable distributed data collection application for collecting data such as logs and delivering it to data stores such as Hadoop's HDFS. It can efficiently collect, aggregate, and move large amounts of log data. It has a simple, but flexible, architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications. + http://incubator.apache.org/projects/flume.html + + + /self + + + + +

Sqoop allows easy imports and exports of data sets between databases and the Hadoop Distributed File System (HDFS).

+ Sqoop allows easy imports and exports of data sets between databases and the Hadoop Distributed File System (HDFS). + http://incubator.apache.org/sqoop/ + + + + + + + + auto + /etc/sqoop/conf + /etc/sqoop/conf.dist + /etc/sqoop/conf.dist + + + + + +

Shared metadata repository for Sqoop.

+ Shared metadata repository for Sqoop. This optional package hosts a metadata server for Sqoop clients across a network to use. + http://incubator.apache.org/sqoop/ + + + /self + + + + sqoop + + + + + +

Oozie is a system that runs workflows of Hadoop jobs.

+ Oozie is a system that runs workflows of Hadoop jobs. Oozie workflows are actions arranged in a control dependency DAG (Direct Acyclic Graph). @@ -237,36 +223,29 @@ Java Servlet and JavaServer Pages techno In case of workflow job failure, the workflow job can be rerun skipping previously completed actions, the workflow application can be patched before being rerun. - http://incubator.apache.org/oozie/ - - - /self - - - - 2345 - stop - true - - - - - oozie - - - - - auto - /etc/oozie/conf - /etc/oozie/conf.dist - /etc/oozie/conf.dist - - - - - -

Client for Oozie Workflow Engine

- Oozie client is a command line client utility that allows remote + http://incubator.apache.org/oozie/ + + + /self + + + + oozie + + + + + auto + /etc/oozie/conf + /etc/oozie/conf.dist + /etc/oozie/conf.dist + + + + + +

Client for Oozie Workflow Engine

+ Oozie client is a command line client utility that allows remote administration and monitoring of worflows. Using this client utility you can submit worflows, start/suspend/resume/kill workflows and find out their status at any instance. Apart from @@ -274,59 +253,51 @@ such operations, you can also change the system, get vesion information. This client utility also allows you to validate any worflows before they are deployed to the Oozie server. - http://incubator.apache.org/oozie/ - - - - - - - - -

A high-performance coordination service for distributed applications.

- ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications. Each time they are implemented there is a lot of work that goes into fixing the bugs and race conditions that are inevitable. Because of the difficulty of implementing these kinds of services, applications initially usually skimp on them ,which make them brittle in the presence of change and difficult to manage. Even when done correctly, different implementations of these services lead to management complexity when the applications are deployed. + http://incubator.apache.org/oozie/ + + + + + + + + +

A high-performance coordination service for distributed applications.

+ ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications. Each time they are implemented there is a lot of work that goes into fixing the bugs and race conditions that are inevitable. Because of the difficulty of implementing these kinds of services, applications initially usually skimp on them ,which make them brittle in the presence of change and difficult to manage. Even when done correctly, different implementations of these services lead to management complexity when the applications are deployed. - http://zookeeper.apache.org/ - - - - - - - zookeeper - - - - - auto - /etc/zookeeper/conf - /etc/zookeeper/conf.dist - /etc/zookeeper/conf.dist - - - - - -

The Hadoop Zookeeper server

- This package starts the zookeeper server on startup - http://zookeeper.apache.org/ - - - /self - - - - 2345 - start - true - true - - - - - -

Pig is a platform for analyzing large data sets

- Pig is a platform for analyzing large data sets that consists of a high-level language + http://zookeeper.apache.org/ + + + + + + + zookeeper + + + + + auto + /etc/zookeeper/conf + /etc/zookeeper/conf.dist + /etc/zookeeper/conf.dist + + + + + +

The Hadoop Zookeeper server

+ This package starts the zookeeper server on startup + http://zookeeper.apache.org/ + + + /self + + + + +

Pig is a platform for analyzing large data sets

+ Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets. @@ -346,25 +317,25 @@ server. automatically, allowing the user to focus on semantics rather than efficiency. * Extensibility Users can create their own functions to do special-purpose processing. - http://pig.apache.org/ - - - - - - - - auto - /etc/pig/conf - /etc/pig/conf.dist - /etc/pig/conf.dist - - - - - -

Hive is a data warehouse infrastructure built on top of Hadoop

- Hive is a data warehouse infrastructure built on top of Hadoop that + http://pig.apache.org/ + + + + + + + + auto + /etc/pig/conf + /etc/pig/conf.dist + /etc/pig/conf.dist + + + + + +

Hive is a data warehouse infrastructure built on top of Hadoop

+ Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple @@ -374,69 +345,55 @@ server. plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language. - http://hive.apache.org/ - - - - - - - - auto - /etc/hive/conf.dist - /etc/hive/conf - /etc/hive/conf.dist - - - - - -

Shared metadata repository for Hive.

- This optional package hosts a metadata server for Hive clients across a network to use. - http://hive.apache.org/ - - - /self - - - - 2345 - start - true - - - - - hive - - - - - -

Provides a Hive Thrift service.

- This optional package hosts a Thrift server for Hive clients across a network to use. - http://hive.apache.org/ - - - /self - - - - 2345 - start - true - - - - - hive - - - - - -

HBase is the Hadoop database. Use it when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

- HBase is an open-source, distributed, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop. HBase includes: + http://hive.apache.org/ + + + + + + + + auto + /etc/hive/conf.dist + /etc/hive/conf + /etc/hive/conf.dist + + + + + +

Shared metadata repository for Hive.

+ This optional package hosts a metadata server for Hive clients across a network to use. + http://hive.apache.org/ + + + /self + + + + hive + + + + + +

Provides a Hive Thrift service.

+ This optional package hosts a Thrift server for Hive clients across a network to use. + http://hive.apache.org/ + + + /self + + + + hive + + + + + +

+ HBase is an open-source, distributed, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop. HBase includes: * Convenient base classes for backing Hadoop MapReduce jobs with HBase tables * Query predicate push down via server side scan and get filters @@ -446,114 +403,79 @@ server. * Cascading source and sink modules * Extensible jruby-based (JIRB) shell * Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX - http://hbase.apache.org/ - - - - - - - - - auto - /etc/hbase/conf.dist - /etc/hbase/conf - /etc/hbase/conf.dist - - - - - hbase - - - - - -

Hbase Documentation

- Documentation for Hbase - http://hbase.apache.org/ - - - - -

The Hadoop HBase master Server.

- HMaster is the "master server" for a HBase. There is only one HMaster for a single HBase deployment. - http://hbase.apache.org/ - - - /self - - - - 2345 - start - true - - /etc/hbase/conf/hbase-site.xml - - hbase.rootdir - hdfs://localhost/hbase - - - - - - - -

The Hadoop HBase RegionServer server.

- HRegionServer makes a set of HRegions available to clients. It checks in with the HMaster. There are many HRegionServers in a single HBase deployment. - http://hbase.apache.org/ - - - /self - - - - 2345 - stop - false - - - - - -

The Hadoop HBase Thrift Interface

- ThriftServer - this class starts up a Thrift server which implements the Hbase API specified in the Hbase.thrift IDL file. + http://hbase.apache.org/ + + + + + + + + + auto + /etc/hbase/conf.dist + /etc/hbase/conf + /etc/hbase/conf.dist + + + + + hbase + + + + + +

Hbase Documentation

+ Documentation for Hbase + http://hbase.apache.org/ + + + + +

The Hadoop HBase master Server.

+ HMaster is the "master server" for a HBase. There is only one HMaster for a single HBase deployment. + http://hbase.apache.org/ + + + /self + + + + +

The Hadoop HBase RegionServer server.

+ HRegionServer makes a set of HRegions available to clients. It checks in with the HMaster. There are many HRegionServers in a single HBase deployment. + http://hbase.apache.org/ + + + /self + + + + +

The Hadoop HBase Thrift Interface

+ ThriftServer - this class starts up a Thrift server which implements the Hbase API specified in the Hbase.thrift IDL file. "Thrift is a software framework for scalable cross-language services development. It combines a powerful software stack with a code generation engine to build services that work efficiently and seamlessly between C++, Java, Python, PHP, and Ruby. Thrift was developed at Facebook, and we are now releasing it as open source." For additional information, see http://developers.facebook.com/thrift/. Facebook has announced their intent to migrate Thrift into Apache Incubator. - http://hbase.apache.org/ - - - /self - - - - 2345 - start - false - - - - - -

The Apache HBase REST gateway

- The Apache HBase REST gateway - http://hbase.apache.org/ - - - /self - - - - 2345 - start - false - - - - - -

Hadoop is a software platform for processing vast amounts of data

- Hadoop is a software platform that lets one easily write and + http://hbase.apache.org/ + + + /self + + + + +

The Apache HBase REST gateway

+ The Apache HBase REST gateway + http://hbase.apache.org/ + + + /self + + + + +

Hadoop is a software platform for processing vast amounts of data

+ Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. Here's what makes Hadoop especially useful: @@ -572,43 +494,43 @@ MapReduce divides applications into many multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. - http://hadoop.apache.org/core/ - - - - - - - auto - /etc/hadoop/conf - /etc/hadoop/conf.empty - /etc/hadoop/conf.empty - - - - - -

The Hadoop Distributed File System

- Hadoop Distributed File System (HDFS) is the primary storage system used by + http://hadoop.apache.org/core/ + + + + + + + auto + /etc/hadoop/conf + /etc/hadoop/conf.empty + /etc/hadoop/conf.empty + + + + + +

The Hadoop Distributed File System

+ Hadoop Distributed File System (HDFS) is the primary storage system used by Hadoop applications. HDFS creates multiple replicas of data blocks and distributes them on compute nodes throughout a cluster to enable reliable, extremely rapid computations. - http://hadoop.apache.org/core/ - - - /self - - - - - hdfs - - - - - -

The Hadoop NextGen MapReduce (YARN)

- YARN (Hadoop NextGen MapReduce) is a general purpose data-computation framework. + http://hadoop.apache.org/core/ + + + /self + + + + + hdfs + + + + + +

The Hadoop NextGen MapReduce (YARN)

+ YARN (Hadoop NextGen MapReduce) is a general purpose data-computation framework. The fundamental idea of YARN is to split up the two major functionalities of the JobTracker, resource management and job scheduling/monitoring, into separate daemons: ResourceManager and NodeManager. @@ -621,311 +543,247 @@ ApplicationMaster (AM). An ApplicationMaster is, in effect, a framework specific library and is tasked with negotiating resources from the ResourceManager and working with the NodeManager(s) to execute and monitor the tasks. - http://hadoop.apache.org/core/ - - - /self - - - - - yarn - - - - - -

The Hadoop MapReduce (MRv2)

- Hadoop MapReduce is a programming model and software framework for writing applications + http://hadoop.apache.org/core/ + + + /self + + + + + yarn + + + + + +

The Hadoop MapReduce (MRv2)

+ Hadoop MapReduce is a programming model and software framework for writing applications that rapidly process vast amounts of data in parallel on large clusters of compute nodes. - http://hadoop.apache.org/core/ - - - /self - - - - - mapred - - - - - -

HTTPFS for Hadoop

- The server providing HTTP REST API support for the complete FileSystem/FileContext + http://hadoop.apache.org/core/ + + + /self + + + + + mapred + + + + + +

HTTPFS for Hadoop

+ The server providing HTTP REST API support for the complete FileSystem/FileContext interface in HDFS. - http://hadoop.apache.org/core/ - - - /self - - - - - httpfs - - - - - 2345 - stop - false - - - - - -

The Hadoop namenode manages the block locations of HDFS files

- The Hadoop Distributed Filesystem (HDFS) requires one unique server, the + http://hadoop.apache.org/core/ + + + /self + + + + + httpfs + + + + + +

The Hadoop namenode manages the block locations of HDFS files

+ The Hadoop Distributed Filesystem (HDFS) requires one unique server, the namenode, which manages the block locations of files on the filesystem. - http://hadoop.apache.org/core/ - - - /self - - - - 2345 - stop - false - true - - - - - -

Hadoop HDFS failover controller

- The Hadoop HDFS failover controller is a ZooKeeper client which also + http://hadoop.apache.org/core/ + + + /self + + + + +

Hadoop HDFS failover controller

+ The Hadoop HDFS failover controller is a ZooKeeper client which also monitors and manages the state of the NameNode. Each of the machines which runs a NameNode also runs a ZKFC, and that ZKFC is responsible for: Health monitoring, ZooKeeper session management, ZooKeeper-based election. - http://hadoop.apache.org/core/ - - - /self - - - - 2345 - stop - false - - - - - -

Hadoop Secondary namenode

- The Secondary Name Node periodically compacts the Name Node EditLog + http://hadoop.apache.org/core/ + + + /self + + + + +

Hadoop Secondary namenode

+ The Secondary Name Node periodically compacts the Name Node EditLog into a checkpoint. This compaction ensures that Name Node restarts do not incur unnecessary downtime. - http://hadoop.apache.org/core/ - - - /self - - - - 2345 - stop - false - - - - - -

Hadoop Data Node

- The Data Nodes in the Hadoop Cluster are responsible for serving up + http://hadoop.apache.org/core/ + + + /self + + + + +

Hadoop Data Node

+ The Data Nodes in the Hadoop Cluster are responsible for serving up blocks of data over the network to Hadoop Distributed Filesystem (HDFS) clients. - http://hadoop.apache.org/core/ - - - /self - - - - 2345 - stop - false - - - - - -

YARN Resource Manager

- The resource manager manages the global assignment of compute resources to applications - http://hadoop.apache.org/core/ - - - /self - - - - 2345 - stop - false - - - - - -

YARN Node Manager

- The NodeManager is the per-machine framework agent who is responsible for + http://hadoop.apache.org/core/ + + + /self + + + + +

YARN Resource Manager

+ The resource manager manages the global assignment of compute resources to applications + http://hadoop.apache.org/core/ + + + /self + + + + +

YARN Node Manager

+ The NodeManager is the per-machine framework agent who is responsible for containers, monitoring their resource usage (cpu, memory, disk, network) and reporting the same to the ResourceManager/Scheduler. - http://hadoop.apache.org/core/ - - - /self - - - - 2345 - stop - false - - - - - -

YARN Web Proxy

- The web proxy server sits in front of the YARN application master web UI. - http://hadoop.apache.org/core/ - - - /self - - - - 2345 - stop - false - - - - - -

MapReduce History Server

- The History server keeps records of the different activities being performed on a Apache Hadoop cluster - http://hadoop.apache.org/core/ - - - /self - - - - 2345 - stop - false - - - - - -

Pseudo-distributed Hadoop configuration

- Contains configuration files for a "pseudo-distributed" Hadoop deployment. + http://hadoop.apache.org/core/ + + + /self + + + + +

YARN Web Proxy

+ The web proxy server sits in front of the YARN application master web UI. + http://hadoop.apache.org/core/ + + + /self + + + + +

MapReduce History Server

+ The History server keeps records of the different activities being performed on a Apache Hadoop cluster + http://hadoop.apache.org/core/ + + + /self + + + + +

Pseudo-distributed Hadoop configuration

+ Contains configuration files for a "pseudo-distributed" Hadoop deployment. In this mode, each of the hadoop components runs as a separate Java process, but all on the same machine. - http://hadoop.apache.org/core/ - - - /self - /self - /self - /self - /self - /self - /self - - - - -

Hadoop Documentation

- Documentation for Hadoop - http://hadoop.apache.org/core/ - - - - -

Hadoop client side dependencies

- Installation of this package will provide you with all the dependencies for Hadoop clients. - http://hadoop.apache.org/core/ - - - /self - /self - /self - /self - - - - -

Mountable HDFS

- These projects (enumerated below) allow HDFS to be mounted (on most flavors of Unix) as a standard file system using - http://hadoop.apache.org/core/ - - - /self - /self - /self - - - - -

A browser-based desktop interface for Hadoop

- Hue is a browser-based desktop interface for interacting with Hadoop. + http://hadoop.apache.org/core/ + + + /self + /self + /self + /self + /self + /self + /self + + + + +

Hadoop Documentation

+ Documentation for Hadoop + http://hadoop.apache.org/core/ + + + + +

Hadoop client side dependencies

+ Installation of this package will provide you with all the dependencies for Hadoop clients. + http://hadoop.apache.org/core/ + + + /self + /self + /self + /self + + + + +

Mountable HDFS

+ These projects (enumerated below) allow HDFS to be mounted (on most flavors of Unix) as a standard file system using + http://hadoop.apache.org/core/ + + + /self + /self + /self + + + + +

A browser-based desktop interface for Hadoop

+ Hue is a browser-based desktop interface for interacting with Hadoop. It supports a file browser, job tracker interface, cluster health monitor, and more. - http://github.com/cloudera/hue - - - /self - /self - - - - auto - /etc/hue/conf - /etc/hue/conf.empty - /etc/hue/conf.empty - - - - - -

Service Scripts for Hue

- This package provides the service scripts for Hue server. - http://github.com/cloudera/hue - - - /self - - - - -

A UI for Hive on Hue

- Beeswax is a web interface for Hive. + http://github.com/cloudera/hue + + + /self + /self + + + + auto + /etc/hue/conf + /etc/hue/conf.empty + /etc/hue/conf.empty + + + + + +

Service Scripts for Hue

+ This package provides the service scripts for Hue server. + http://github.com/cloudera/hue + + + /self + + + + +

A UI for Hive on Hue

+ Beeswax is a web interface for Hive. It allows users to construct and run queries on Hive, manage tables, and import and export data. - http://github.com/cloudera/hue - - - /self - - - - - - -

The hue metapackage

- Hue is a browser-based desktop interface for interacting with Hadoop. It supports a file browser, job tracker interface, cluster health monitor, and more. - http://github.com/cloudera/hue - - - /self - /self - - + http://github.com/cloudera/hue + + + /self + + + + + + +

The hue metapackage

+ Hue is a browser-based desktop interface for interacting with Hadoop. It supports a file browser, job tracker interface, cluster health monitor, and more. + http://github.com/cloudera/hue + + + /self + /self + + Modified: incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/yum/package_data.xml URL: http://svn.apache.org/viewvc/incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/yum/package_data.xml?rev=1353693&r1=1353692&r2=1353693&view=diff ============================================================================== --- incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/yum/package_data.xml (original) +++ incubator/bigtop/branches/branch-0.4/bigtop-tests/test-artifacts/package/src/main/resources/yum/package_data.xml Mon Jun 25 18:23:02 2012 @@ -1,4 +1,4 @@ - + + 3 + 4 + 5 + stop + true + + @@ -97,6 +116,16 @@ >=1.6 + + + 2 + 3 + 4 + 5 + stop + true + +

@@ -123,6 +152,16 @@

+ + + 3 + 4 + 5 + stop + true + true + + @@ -146,6 +185,15 @@ /sbin/nologin + + + 3 + 4 + 5 + stop + true + +

@@ -155,6 +203,15 @@ /sbin/nologin + + + 3 + 4 + 5 + stop + true + +

@@ -183,6 +240,27 @@

+ + + 2 + 3 + 4 + 5 + stop + true + + /etc/hbase/conf/hbase-site.xml + + hbase.rootdir + hdfs://localhost/hbase + + + hbase.cluster.distributed + true + + + +

@@ -190,6 +268,16 @@

+ + + 2 + 3 + 4 + 5 + stop + false + +

@@ -197,7 +285,33 @@

+ + + 2 + 3 + 4 + 5 + stop + false + + + + + /self + + + + + 2 + 3 + 4 + 5 + stop + false + + +

@@ -228,13 +342,43 @@

+ + + 3 + 4 + 5 + stop + false + true + + + + + + 3 + 4 + 5 + stop + false + + +

+ + + 3 + 4 + 5 + stop + false + +

@@ -242,6 +386,15 @@ + + + 3 + 4 + 5 + stop + false + +

@@ -258,6 +411,15 @@ + + + 3 + 4 + 5 + stop + false + + @@ -265,7 +427,53 @@ + + + 3 + 4 + 5 + stop + false + + + + + + + + + + 3 + 4 + 5 + stop + false + + /etc/hadoop/conf/yarn-site.xml + + yarn.web-proxy.address + 0.0.0.0:8032 + + + + + + + + + + + + + 3 + 4 + 5 + stop + false + + +

@@ -283,6 +491,15 @@ /bin/bash + + + 3 + 4 + 5 + stop + false + +

@@ -311,4 +528,15 @@ package or when debugging this package.< http://hadoop.apache.org/core/ + + + + 3 + 4 + 5 + stop + true + + +