incubator-hcatalog-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject svn commit: r1176471 - in /incubator/hcatalog/trunk: CHANGES.txt README.txt
Date Tue, 27 Sep 2011 17:06:19 GMT
Author: khorgath
Date: Tue Sep 27 17:06:18 2011
New Revision: 1176471

HCATALOG-114 : Removing duplicate info from README.txt (ctdean via khorgath)


Modified: incubator/hcatalog/trunk/CHANGES.txt
--- incubator/hcatalog/trunk/CHANGES.txt (original)
+++ incubator/hcatalog/trunk/CHANGES.txt Tue Sep 27 17:06:18 2011
@@ -45,6 +45,8 @@ Trunk (unreleased changes)
   HCAT-78. Update notice.txt and headers and add Disclaimer.txt (hashutosh and gates via
+  HCAT-114. README.txt has duplicate info (ctdean via khorgath)
 Release 0.2.0 - Unreleased

Modified: incubator/hcatalog/trunk/README.txt
--- incubator/hcatalog/trunk/README.txt (original)
+++ incubator/hcatalog/trunk/README.txt Tue Sep 27 17:06:18 2011
@@ -34,43 +34,3 @@ For the latest information about HCatalo
 and our wiki, at:
-Apache HCatalog
-HCatalog is a table and storage management service for data created using Apache 
-The vision of HCatalog is to provide table management and storage management layers
-for Apache Hadoop. This includes:
- * Providing a shared schema and data type mechanism.
- * Providing a table abstraction so that users need not be concerned with where
-   or how their data is stored.
- * Providing interoperability across data processing tools such as Pig, Map
-   Reduce, Streaming, and Hive. 
-Data processors using Apache Hadoop have a common need for table management
-services. The goal of this table management service is to track data that exists in
-a Hadoop grid and present that data to users in a tabular format. HCatalog
-provides a single input and output format to users so that individual users need
-not be concerned with the storage formats that are chosen for particular data
-sets. Data is described by a schema and shares a datatype system.
-Users are free to choose the best tools for their use cases. The Hadoop project
-includes Map Reduce, Streaming, Pig, and Hive, and additional tools exist such
-as Cascading. Each of these tools has users who prefer it, and there are use
-cases best addressed by each of these tools. Two users on the same grid who
-share data are not constrained to use the same tool but with HCatalog are free
-to choose the best tool for their use case.  HCatalog presents data in the same
-way to all of the tools, providing interfaces to each of them.
-For the latest information about HCatalog, please visit our website at:
-and our wiki, at:

View raw message