drill-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From tshi...@apache.org
Subject [17/30] drill git commit: 1.3 release blog post
Date Mon, 23 Nov 2015 21:54:00 GMT
1.3 release blog post


Project: http://git-wip-us.apache.org/repos/asf/drill/repo
Commit: http://git-wip-us.apache.org/repos/asf/drill/commit/a5a76349
Tree: http://git-wip-us.apache.org/repos/asf/drill/tree/a5a76349
Diff: http://git-wip-us.apache.org/repos/asf/drill/diff/a5a76349

Branch: refs/heads/gh-pages
Commit: a5a763494c50e43c7329455c54b589d5f8b62e83
Parents: 7cb973a
Author: Tomer Shiran <tshiran@gmail.com>
Authored: Mon Nov 23 10:01:04 2015 -0800
Committer: Tomer Shiran <tshiran@gmail.com>
Committed: Mon Nov 23 10:01:04 2015 -0800

----------------------------------------------------------------------
 blog/_posts/2015-11-23-drill-1.3-released.md | 68 +++++++++++++++++++++++
 1 file changed, 68 insertions(+)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/drill/blob/a5a76349/blog/_posts/2015-11-23-drill-1.3-released.md
----------------------------------------------------------------------
diff --git a/blog/_posts/2015-11-23-drill-1.3-released.md b/blog/_posts/2015-11-23-drill-1.3-released.md
new file mode 100644
index 0000000..f299320
--- /dev/null
+++ b/blog/_posts/2015-11-23-drill-1.3-released.md
@@ -0,0 +1,68 @@
+---
+layout: post
+title: "Drill 1.3 Released"
+code: drill-1.3-released
+excerpt: Drill 1.3 has been released. Users can now query Hadoop sequence files and text
delimited files with headers. In addition, this release provides significant performance and
usability improvements for working with Amazon S3. Drill 1.3 also adds support for heterogeneous
types, enabling queries on datasets with columns that have more than one data type (commonly
seen in JSON files, MongoDB collections, etc.).
+authors: ["jnadeau"]
+---
+Today I'm happy to announce the availability of the Drill 1.3 release. This release addresses
[58 JIRAs](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313820&version=12332946)
on top of the 1.2 release. Highlights include:
+
+## Enhanced Amazon S3 Support
+
+Drill 1.3 utilizes a new library, called s3a, for reading data from S3. The s3a library includes
improvements over the previous s3n library, such as higher performance and the ability to
read large files (over 5GB).
+
+In addition to the new s3a library, Drill 1.3 makes it easier to set up your AWS credentials.
Simply edit the file `conf/core-site.xml` in your Drill install directory. Check out the [step-by-step
instructions in the documentation](/docs/s3-storage-plugin/).
+
+## Heterogeneous Types
+
+
+
+## Text File Headers
+
+Drill is now able to parse the header row in text files (CSV, TSV, etc.). Prior to Drill
1.3, data had to be accessed through the `columns` array:
+
+    SELECT columns[0], columns[1] FROM dfs.`/path/to/users.csv`
+    
+With Drill 1.3, you can use the actual column names in the CSV file:
+
+    SELECT name, address FROM dfs.`/path/to/users.csv`
+
+Follow the [steps in the documentation](/docs/text-files-csv-tsv-psv/) to enable header parsing.
You'll need to set the `extractHeader` parameter in the storage plugin configuration for the
desired file extensions.
+
+## Sequence Files
+
+Drill now [supports sequence files](/docs/querying-sequence-files/), a format commonly used
in the Hadoop ecosystem. A sequence file contains a series of keys and values, and querying
it with Drill is as easy as querying any other self-describing format:
+
+
+    SELECT *
+    FROM dfs.tmp.`simple.seq`
+    LIMIT 1;
+    +--------------+---------------+
+    |  binary_key  | binary_value  |
+    +--------------+---------------+
+    | [B@70828f46  | [B@b8c765f    |
+    +--------------+---------------+
+
+
+Drill's `CONVERT_FROM` function makes it easy to decode the binary values:
+
+
+    SELECT CONVERT_FROM(binary_key, 'UTF8'), CONVERT_FROM(binary_value, 'UTF8')
+    FROM dfs.tmp.`simple.seq`
+    LIMIT 1
+    ;
+    +-----------+-------------+
+    |  EXPR$0   |   EXPR$1    |
+    +-----------+-------------+
+    | key0      |   value0    |
+    +-----------+-------------+
+
+
+## Many More Fixes
+
+Drill 1.3 includes many other improvements, including enhancements related to querying Hive
tables, MongoDB collections and Avro files. Check out the complete list of [fixes and enhancements](https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313820&version=12332946)
for more information.
+
+Download the [Drill 1.3 release](https://drill.apache.org/download/) now and let us know
your thoughts.
+
+Drill On!
+Jacques Nadeau


Mime
View raw message