drill-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bridg...@apache.org
Subject [16/31] drill git commit: 1.3 release blog post
Date Wed, 25 Nov 2015 22:03:04 GMT
1.3 release blog post


Project: http://git-wip-us.apache.org/repos/asf/drill/repo
Commit: http://git-wip-us.apache.org/repos/asf/drill/commit/64d160b8
Tree: http://git-wip-us.apache.org/repos/asf/drill/tree/64d160b8
Diff: http://git-wip-us.apache.org/repos/asf/drill/diff/64d160b8

Branch: refs/heads/gh-pages
Commit: 64d160b86b7406064f11cce3fdaf85381fa58a0f
Parents: c7bfffa
Author: Tomer Shiran <tshiran@gmail.com>
Authored: Mon Nov 23 10:01:04 2015 -0800
Committer: Kristine Hahn <khahn@maprtech.com>
Committed: Wed Nov 25 10:13:43 2015 -0800

----------------------------------------------------------------------
 blog/_posts/2015-11-23-drill-1.3-released.md | 68 +++++++++++++++++++++++
 1 file changed, 68 insertions(+)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/drill/blob/64d160b8/blog/_posts/2015-11-23-drill-1.3-released.md
----------------------------------------------------------------------
diff --git a/blog/_posts/2015-11-23-drill-1.3-released.md b/blog/_posts/2015-11-23-drill-1.3-released.md
new file mode 100644
index 0000000..f299320
--- /dev/null
+++ b/blog/_posts/2015-11-23-drill-1.3-released.md
@@ -0,0 +1,68 @@
+---
+layout: post
+title: "Drill 1.3 Released"
+code: drill-1.3-released
+excerpt: Drill 1.3 has been released. Users can now query Hadoop sequence files and text
delimited files with headers. In addition, this release provides significant performance and
usability improvements for working with Amazon S3. Drill 1.3 also adds support for heterogeneous
types, enabling queries on datasets with columns that have more than one data type (commonly
seen in JSON files, MongoDB collections, etc.).
+authors: ["jnadeau"]
+---
+Today I'm happy to announce the availability of the Drill 1.3 release. This release addresses
[58 JIRAs](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313820&version=12332946)
on top of the 1.2 release. Highlights include:
+
+## Enhanced Amazon S3 Support
+
+Drill 1.3 utilizes a new library, called s3a, for reading data from S3. The s3a library includes
improvements over the previous s3n library, such as higher performance and the ability to
read large files (over 5GB).
+
+In addition to the new s3a library, Drill 1.3 makes it easier to set up your AWS credentials.
Simply edit the file `conf/core-site.xml` in your Drill install directory. Check out the [step-by-step
instructions in the documentation](/docs/s3-storage-plugin/).
+
+## Heterogeneous Types
+
+
+
+## Text File Headers
+
+Drill is now able to parse the header row in text files (CSV, TSV, etc.). Prior to Drill
1.3, data had to be accessed through the `columns` array:
+
+    SELECT columns[0], columns[1] FROM dfs.`/path/to/users.csv`
+    
+With Drill 1.3, you can use the actual column names in the CSV file:
+
+    SELECT name, address FROM dfs.`/path/to/users.csv`
+
+Follow the [steps in the documentation](/docs/text-files-csv-tsv-psv/) to enable header parsing.
You'll need to set the `extractHeader` parameter in the storage plugin configuration for the
desired file extensions.
+
+## Sequence Files
+
+Drill now [supports sequence files](/docs/querying-sequence-files/), a format commonly used
in the Hadoop ecosystem. A sequence file contains a series of keys and values, and querying
it with Drill is as easy as querying any other self-describing format:
+
+
+    SELECT *
+    FROM dfs.tmp.`simple.seq`
+    LIMIT 1;
+    +--------------+---------------+
+    |  binary_key  | binary_value  |
+    +--------------+---------------+
+    | [B@70828f46  | [B@b8c765f    |
+    +--------------+---------------+
+
+
+Drill's `CONVERT_FROM` function makes it easy to decode the binary values:
+
+
+    SELECT CONVERT_FROM(binary_key, 'UTF8'), CONVERT_FROM(binary_value, 'UTF8')
+    FROM dfs.tmp.`simple.seq`
+    LIMIT 1
+    ;
+    +-----------+-------------+
+    |  EXPR$0   |   EXPR$1    |
+    +-----------+-------------+
+    | key0      |   value0    |
+    +-----------+-------------+
+
+
+## Many More Fixes
+
+Drill 1.3 includes many other improvements, including enhancements related to querying Hive
tables, MongoDB collections and Avro files. Check out the complete list of [fixes and enhancements](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313820&version=12332946)
for more information.
+
+Download the [Drill 1.3 release](https://drill.apache.org/download/) now and let us know
your thoughts.
+
+Drill On!
+Jacques Nadeau


Mime
View raw message