drill-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From krish...@apache.org
Subject [06/11] drill git commit: 1.4 updates
Date Mon, 14 Dec 2015 23:48:57 GMT
1.4 updates


Project: http://git-wip-us.apache.org/repos/asf/drill/repo
Commit: http://git-wip-us.apache.org/repos/asf/drill/commit/af189bfe
Tree: http://git-wip-us.apache.org/repos/asf/drill/tree/af189bfe
Diff: http://git-wip-us.apache.org/repos/asf/drill/diff/af189bfe

Branch: refs/heads/gh-pages
Commit: af189bfea5e6a7e30ced3aa83722f41d6617b626
Parents: a694d58
Author: Kris Hahn <krishahn@apache.org>
Authored: Mon Dec 14 11:04:40 2015 -0800
Committer: Kris Hahn <krishahn@apache.org>
Committed: Mon Dec 14 15:46:38 2015 -0800

----------------------------------------------------------------------
 .../partition-pruning/010-partition-pruning-introduction.md  | 2 +-
 .../partition-pruning/030-using-partition-pruning.md         | 8 ++------
 2 files changed, 3 insertions(+), 7 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/drill/blob/af189bfe/_docs/performance-tuning/partition-pruning/010-partition-pruning-introduction.md
----------------------------------------------------------------------
diff --git a/_docs/performance-tuning/partition-pruning/010-partition-pruning-introduction.md
b/_docs/performance-tuning/partition-pruning/010-partition-pruning-introduction.md
index 2a94e3d..0271bec 100755
--- a/_docs/performance-tuning/partition-pruning/010-partition-pruning-introduction.md
+++ b/_docs/performance-tuning/partition-pruning/010-partition-pruning-introduction.md
@@ -13,7 +13,7 @@ Before using partitioned Drill 1.1-1.2 data in Drill 1.3, you need to migrate
th
 {% include startimportant.html %}Migrate only Parquet files that Drill generated.{% include
endimportant.html %}
 
 ## Partitioning Data
-Prior to the release of Drill 1.1, partition pruning involved time-consuming manual setup
tasks. Using the PARTITION BY clause in the CTAS command simplifies the process. ["How to
Partition Data"]{{site.baseurl}}(/docs/using-partition-pruning/#how-to-partition-data) describes
this process.
+Prior to the release of Drill 1.1, partition pruning involved time-consuming manual setup
tasks. Using the PARTITION BY clause in the CTAS command simplifies the process.
 
 
 

http://git-wip-us.apache.org/repos/asf/drill/blob/af189bfe/_docs/performance-tuning/partition-pruning/030-using-partition-pruning.md
----------------------------------------------------------------------
diff --git a/_docs/performance-tuning/partition-pruning/030-using-partition-pruning.md b/_docs/performance-tuning/partition-pruning/030-using-partition-pruning.md
index e6620cb..ed20bcf 100755
--- a/_docs/performance-tuning/partition-pruning/030-using-partition-pruning.md
+++ b/_docs/performance-tuning/partition-pruning/030-using-partition-pruning.md
@@ -1,13 +1,9 @@
 ---
-title: "Using Partition Pruning"
+title: "How to Partition Data"
 parent: "Partition Pruning"
 --- 
 
-In Drill 1.1.0 and later, if the data source is Parquet, no data organization tasks are required
to take advantage of partition pruning. To partition and query Parquet files generated from
other tools, use Drill to read and rewrite the files and metadata using the CTAS command with
the PARTITION BY clause, as described in the following section "How to Partition Data".
-
-## How to Partition Data
-
-In Drill 1.1.0 and later, write Parquet data using the [PARTITION BY]({{site.baseurl}}/docs/partition-by-clause/)
clause in the CTAS statement. 
+In Drill 1.1.0 and later, if the data source is Parquet, no data organization tasks are required
to take advantage of partition pruning. To partition and query Parquet files generated from
other tools, use Drill to read and rewrite the files and metadata using the CTAS command with
the [PARTITION BY]({{site.baseurl}}/docs/partition-by-clause/) clause in the CTAS statement.

 
 The Parquet writer first sorts data by the partition keys, and then creates a new file when
it encounters a new value for the partition columns. During partitioning, Drill creates separate
files, but not separate directories, for different partitions. Each file contains exactly
one partition value, but there can be multiple files for the same partition value. 
 


Mime
View raw message