carbondata-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject [1/2] incubator-carbondata git commit: update readme as per IPMC comments
Date Wed, 09 Nov 2016 05:18:32 GMT
Repository: incubator-carbondata
Updated Branches:
  refs/heads/master 6c907dcc1 -> 97377afae

update readme as per IPMC comments


Branch: refs/heads/master
Commit: a8b33f24b387b117383f162627dc956caf152b6e
Parents: 6c907dc
Author: chenliang613 <>
Authored: Wed Nov 9 10:11:51 2016 +0800
Committer: chenliang613 <>
Committed: Wed Nov 9 10:11:51 2016 +0800

---------------------------------------------------------------------- | 9 ++++-----
 1 file changed, 4 insertions(+), 5 deletions(-)
diff --git a/ b/
index 31af7ed..f3e5e8a 100644
--- a/
+++ b/
@@ -19,11 +19,13 @@
 <img src="/docs/images/format/CarbonData_logo.png" width="200" height="40">
-Apache CarbonData is a new big data file format for faster
+Apache CarbonData(incubating) is a new big data file format for faster
 interactive query using advanced columnar storage, index, compression
 and encoding techniques to improve computing efficiency, in turn it will 
 help speedup queries an order of magnitude faster over PetaBytes of data. 
+You can find the latest CarbonData document and learn more at [CarbonData cwiki](
 ### Features
 CarbonData file format is a columnar store in HDFS, it has many features that a modern columnar
format has, such as splittable, compression schema ,complex data type etc, and CarbonData
has following unique features:
 * Stores data along with index: it can significantly accelerate query performance and reduces
the I/O scans and CPU resources, where there are filters in the query.  CarbonData index consists
of multiple level of indices, a processing framework can leverage this index to reduce the
task it needs to schedule and process, and it can also do skip scan in more finer grain unit
(called blocklet) in task side scanning instead of scanning the whole file. 
@@ -31,9 +33,6 @@ CarbonData file format is a columnar store in HDFS, it has many features
that a
 * Column group: Allow multiple columns to form a column group that would be stored as row
format. This reduces the row reconstruction cost at query time.
 * Supports for various use cases with one single Data format : like interactive OLAP-style
query, Sequential Access (big scan), Random Access (narrow scan). 
-### Documentation
-Please visit [CarbonData cwiki](
 ### Building CarbonData,using development tools and cluster deployment guide
 Please refer [Building CarbonData and Configuring IDE](
@@ -73,4 +72,4 @@ To get involved in CarbonData:
 ## About
 Apache CarbonData is an open source project of The Apache Software Foundation (ASF).
-CarbonData project original contributed from the [Huawei](

View raw message