drill-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From tshi...@apache.org
Subject [06/31] drill git commit: stale avro references
Date Mon, 18 May 2015 23:36:29 GMT
stale avro references


Project: http://git-wip-us.apache.org/repos/asf/drill/repo
Commit: http://git-wip-us.apache.org/repos/asf/drill/commit/99af7ad3
Tree: http://git-wip-us.apache.org/repos/asf/drill/tree/99af7ad3
Diff: http://git-wip-us.apache.org/repos/asf/drill/diff/99af7ad3

Branch: refs/heads/gh-pages
Commit: 99af7ad3e5924c32ad6620bd778b39ebb3d2d480
Parents: 313dbd6
Author: Kristine Hahn <khahn@maprtech.com>
Authored: Sun May 17 12:57:06 2015 -0700
Committer: Kristine Hahn <khahn@maprtech.com>
Committed: Sun May 17 12:57:06 2015 -0700

----------------------------------------------------------------------
 .../010-architecture-introduction.md            | 45 +-------------------
 .../architectural-highlights/010-flexibility.md |  2 +-
 .../120-configuring-the-drill-shell.md          |  2 +-
 .../040-parquet-format.md                       |  2 +-
 .../020-apache-drill-contribution-ideas.md      |  1 -
 _docs/getting-started/010-drill-introduction.md |  4 +-
 6 files changed, 7 insertions(+), 49 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/drill/blob/99af7ad3/_docs/architecture/010-architecture-introduction.md
----------------------------------------------------------------------
diff --git a/_docs/architecture/010-architecture-introduction.md b/_docs/architecture/010-architecture-introduction.md
index 67fa416..45fad19 100644
--- a/_docs/architecture/010-architecture-introduction.md
+++ b/_docs/architecture/010-architecture-introduction.md
@@ -48,51 +48,8 @@ The flow of a Drill query typically involves the following steps:
 
 You can access Drill through the following interfaces:
 
-  * [Drill shell (SQLLine)]({{ site.baseurl }}/docs/install-drill)
+  * [Drill shell]({{ site.baseurl }}/docs/install-drill)
   * [Drill Web UI]({{ site.baseurl }}/docs/monitoring-and-canceling-queries-in-the-drill-web-ui)
   * [ODBC/JDBC]({{ site.baseurl }}/docs/odbc-jdbc-interfaces/#using-odbc-to-access-apache-drill-from-bi-tools)

   * C++ API
 
-### **_Dynamic schema discovery_**
-
-Drill does not require schema or type specification for data in order to start
-the query execution process. Drill starts data processing in record-batches
-and discovers the schema during processing. Self-describing data formats such
-as Parquet, JSON, AVRO, and NoSQL databases have schema specified as part of
-the data itself, which Drill leverages dynamically at query time. Because
-schema can change over the course of a Drill query, all Drill operators are
-designed to reconfigure themselves when schemas change.
-
-### **_Flexible data model_**
-
-Drill allows access to nested data attributes, just like SQL columns, and
-provides intuitive extensions to easily operate on them. From an architectural
-point of view, Drill provides a flexible hierarchical columnar data model that
-can represent complex, highly dynamic and evolving data models. Drill allows
-for efficient processing of these models without the need to flatten or
-materialize them at design time or at execution time. Relational data in Drill
-is treated as a special or simplified case of complex/multi-structured data.
-
-### **_De-centralized metadata_**
-
-Drill does not have a centralized metadata requirement. You do not need to
-create and manage tables and views in a metadata repository, or rely on a
-database administrator group for such a function. Drill metadata is derived
-from the storage plugins that correspond to data sources. Storage plugins
-provide a spectrum of metadata ranging from full metadata (Hive), partial
-metadata (HBase), or no central metadata (files). De-centralized metadata
-means that Drill is NOT tied to a single Hive repository. You can query
-multiple Hive repositories at once and then combine the data with information
-from HBase tables or with a file in a distributed file system. You can also
-use SQL DDL syntax to create metadata within Drill, which gets organized just
-like a traditional database. Drill metadata is accessible through the ANSI
-standard INFORMATION_SCHEMA database.
-
-### **_Extensibility_**
-
-Drill provides an extensible architecture at all layers, including the storage
-plugin, query, query optimization/execution, and client API layers. You can
-customize any layer for the specific needs of an organization or you can
-extend the layer to a broader array of use cases. Drill provides a built in
-classpath scanning and plugin concept to add additional storage plugins,
-functions, and operators with minimal configuration.
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/drill/blob/99af7ad3/_docs/architecture/architectural-highlights/010-flexibility.md
----------------------------------------------------------------------
diff --git a/_docs/architecture/architectural-highlights/010-flexibility.md b/_docs/architecture/architectural-highlights/010-flexibility.md
index 58e6107..a04d4ea 100644
--- a/_docs/architecture/architectural-highlights/010-flexibility.md
+++ b/_docs/architecture/architectural-highlights/010-flexibility.md
@@ -15,7 +15,7 @@ The following features contribute to Drill's flexible architecture:
 Drill does not require schema or type specification for the data in order to
 start the query execution process. Instead, Drill starts processing the data
 in units called record-batches and discovers the schema on the fly during
-processing. Self-describing data formats such as Parquet, JSON, AVRO, and
+processing. Self-describing data formats such as Parquet, JSON, Avro, and
 NoSQL databases have schema specified as part of the data itself, which Drill
 leverages dynamically at query time. Schema can change over the course of a
 Drill query, so all of the Drill operators are designed to reconfigure

http://git-wip-us.apache.org/repos/asf/drill/blob/99af7ad3/_docs/configure-drill/120-configuring-the-drill-shell.md
----------------------------------------------------------------------
diff --git a/_docs/configure-drill/120-configuring-the-drill-shell.md b/_docs/configure-drill/120-configuring-the-drill-shell.md
index b9ff103..d7c474a 100644
--- a/_docs/configure-drill/120-configuring-the-drill-shell.md
+++ b/_docs/configure-drill/120-configuring-the-drill-shell.md
@@ -2,7 +2,7 @@
 title: "Configuring the Drill Shell"
 parent: "Configure Drill"
 ---
-At the Drill shell command prompt, typing "help" lists the configuration and other options
you can set to manage shell functionality. Apache Drill 1.0 formats the resultset output tables
for readability if possible. In this release, columns having 70 characters or more cannot
be formatted. This document formats all output for readability and example purposes.
+After [starting the Drill shell]({{site.baseurl}}/docs/starting-drill-on-linux-and-mac-os-x/),
you can type queries on the shell command line. At the Drill shell command prompt, typing
"help" lists the configuration and other options you can set to manage shell functionality.
Apache Drill 1.0 formats the resultset output tables for readability if possible. In this
release, columns having 70 characters or more cannot be formatted. This document formats all
output for readability and example purposes.
 
 Formatting tables takes time, which you might notice if running a huge query using the default
`outputFormat` setting, which is `table` of the Drill shell. You can set another, more performant
table formatting such as `csv`, as shown in the [examples]({{site.baseurl}}/docs/configuring-the-drill-shell/#examples-of-configuring-the-drill-shell).

 

http://git-wip-us.apache.org/repos/asf/drill/blob/99af7ad3/_docs/data-sources-and-file-formats/040-parquet-format.md
----------------------------------------------------------------------
diff --git a/_docs/data-sources-and-file-formats/040-parquet-format.md b/_docs/data-sources-and-file-formats/040-parquet-format.md
index cd14359..ca8b164 100644
--- a/_docs/data-sources-and-file-formats/040-parquet-format.md
+++ b/_docs/data-sources-and-file-formats/040-parquet-format.md
@@ -158,4 +158,4 @@ Parquet supports the following data description languages:
 * Apache Thrift
 * Google Protocol Buffers 
 
-Implement custom storage plugins, such as an Avro plugin, to create Parquet readers/writers
for these formats. 
\ No newline at end of file
+Implement custom storage plugins to create Parquet readers/writers for formats such as Thrift.

\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/drill/blob/99af7ad3/_docs/developer-information/contribute-to-drill/020-apache-drill-contribution-ideas.md
----------------------------------------------------------------------
diff --git a/_docs/developer-information/contribute-to-drill/020-apache-drill-contribution-ideas.md
b/_docs/developer-information/contribute-to-drill/020-apache-drill-contribution-ideas.md
index c3f5a87..a4ef0b8 100644
--- a/_docs/developer-information/contribute-to-drill/020-apache-drill-contribution-ideas.md
+++ b/_docs/developer-information/contribute-to-drill/020-apache-drill-contribution-ideas.md
@@ -54,7 +54,6 @@ Currently Drill supports text, JSON and Parquet file formats natively when
 interacting with file system. More readers/writers can be introduced by
 implementing custom storage plugins. Example formats are.
 
-  * AVRO
   * Sequence
   * RC
   * ORC

http://git-wip-us.apache.org/repos/asf/drill/blob/99af7ad3/_docs/getting-started/010-drill-introduction.md
----------------------------------------------------------------------
diff --git a/_docs/getting-started/010-drill-introduction.md b/_docs/getting-started/010-drill-introduction.md
index 100d9da..3645cfb 100644
--- a/_docs/getting-started/010-drill-introduction.md
+++ b/_docs/getting-started/010-drill-introduction.md
@@ -16,7 +16,9 @@ Apache Drill 1.0 offers the following new features:
 * [Query audit logging]({{site.baseurl}}/docs/getting-query-information/) for getting the
query history on a Drillbit.
 * Improved connection handling.
 * New Errors tab in the Query Profiles UI that facilitates troubleshooting and distributed
storing of profiles.
-* Support for new storage plugin format: [Avro](http://avro.apache.org/docs/current/spec.html)
+* Support for a new storage plugin input format: [Avro](http://avro.apache.org/docs/current/spec.html)
+
+In this release, Drill disables the DECIMAL data type, including casting to DECIMAL and reading
DECIMAL types from Parquet and Hive. To enable the DECIMAL type, set the `planner.enable_decimal_data_type`
system option to `true`.
 
 Key features of Apache Drill are:
 


Mime
View raw message