drill-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From tshi...@apache.org
Subject [1/3] drill-site git commit: Website update
Date Tue, 19 May 2015 12:01:27 GMT
Repository: drill-site
Updated Branches:
  refs/heads/asf-site 763d5b9d1 -> 1e0ae3431

diff --git a/feed.xml b/feed.xml
index f6b13ba..60b7d81 100644
--- a/feed.xml
+++ b/feed.xml
@@ -1,16 +1,109 @@
 <?xml version="1.0" encoding="UTF-8"?>
 <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-    <title>Apache Drill - Self-Service Data Exploration</title>
+    <title>Apache Drill - Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud
     <description>Write an awesome description for your new site here. You can edit
this line in _config.yml. It will appear in your document head meta (for Google search results)
and in your feed.xml site description.
     <atom:link href="/feed.xml" rel="self" type="application/rss+xml"/>
-    <pubDate>Mon, 18 May 2015 16:35:27 -0700</pubDate>
-    <lastBuildDate>Mon, 18 May 2015 16:35:27 -0700</lastBuildDate>
+    <pubDate>Mon, 18 May 2015 23:51:33 -0700</pubDate>
+    <lastBuildDate>Mon, 18 May 2015 23:51:33 -0700</lastBuildDate>
     <generator>Jekyll v2.5.1</generator>
+        <title>The Apache Software Foundation Announces Apache Drill 1.0</title>
+        <description>&lt;p&gt;&lt;strong&gt;Thousands of users adopt
Open Source, enterprise-grade, schema-free SQL query engine for Apache Hadoop®, NoSQL and
Cloud storage&lt;/strong&gt;&lt;/p&gt;
+&lt;p&gt;Forest Hill, MD --19 May 2015-- The Apache Software Foundation (ASF), the
all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and
initiatives, announced today the availability of Apache™ Drill™ 1.0, the schema-free SQL
query engine for Apache Hadoop®, NoSQL and Cloud storage.&lt;/p&gt;
+&lt;p&gt;&amp;quot;The production-ready 1.0 release represents a significant
milestone for the Drill project,&amp;quot; said Tomer Shiran, member of the Apache Drill
Project Management Committee. &amp;quot;It is the outcome of almost three years of development
involving dozens of engineers from numerous companies. Apache Drill&amp;#39;s flexibility
and ease-of-use have attracted thousands of users, and the enterprise-grade reliability, security
and performance in the 1.0 release will further accelerate adoption.&amp;quot;&lt;/p&gt;
+&lt;p&gt;With the exponential growth of data in recent years, and the shift towards
rapid application development, new data is increasingly being stored in non-relational, schema-free
datastores including Hadoop, NoSQL and Cloud storage. Apache Drill revolutionizes data exploration
and analytics by enabling analysts, business users, data scientists and developers to explore
and analyze this data without sacrificing the flexibility and agility offered by these datastores.
Drill processes the data in-situ without requiring users to define schemas or transform data.&lt;/p&gt;
+&lt;p&gt;&amp;quot;Drill introduces the JSON document model to the world of SQL-based
analytics and BI&amp;quot; said Jacques Nadeau, Vice President of Apache Drill. &amp;quot;This
enables users to query fixed-schema, evolving-schema and schema-free data stored in a variety
of formats and datastores. The architecture of relational query engines and databases is built
on the assumption that all data has a simple and static structure that’s known in advance,
and this 40-year-old assumption is simply no longer valid. We designed Drill from the ground
up to address the new reality.”&lt;/p&gt;
+&lt;p&gt;Apache Drill&amp;#39;s architecture is unique in many ways. It is the
only columnar execution engine that supports complex and schema-free data, and the only execution
engine that performs data-driven query compilation (and re-compilation, also known as schema
discovery) during query execution. These unique capabilities enable Drill to achieve record-breaking
performance with the flexibility offered by the JSON document model.&lt;/p&gt;
+&lt;p&gt;The business intelligence (BI) partner ecosystem is embracing the power
of Apache Drill. Organizations such as Information Builders, JReport (Jinfonet Software),
MicroStrategy, Qlik®, Simba, Tableau, and TIBCO, are working closely with the Drill community
to interoperate BI tools with Drill through standard ODBC and JDBC connectivity. This collaboration
enables end users to explore data by leveraging sophisticated visualization tools and advanced
+&lt;p&gt;&amp;quot;We&amp;#39;ve been using Apache Drill for the past six
months,&amp;quot; said Andrew Hamilton, CTO of Cardlytics. &amp;quot;Its ease of deployment
and use along with its ability to quickly process trillions of records has made it an invaluable
tool inside Cardlytics. Queries that were previously insurmountable are now common occurrence.
Congratulations to the Drill community on this momentous occasion.&amp;quot;&lt;/p&gt;
+&lt;p&gt;&amp;quot;Drill&amp;#39;s columnar execution engine and optimizer
take full advantage of Apache Parquet&amp;#39;s columnar storage to achieve maximum performance,&amp;quot;
said Julien Le Dem, Technical Lead of Data Processing at Twitter and Vice President of Apache
Parquet. &amp;quot;The Drill team has been a key contributor to the Parquet project, including
recent enhancements to Parquet types and vectorization. The Drill team’s involvement in
the Parquet community is instrumental in driving the standard.&amp;quot;&lt;/p&gt;
+&lt;p&gt;&amp;quot;Apache Drill 1.0 raises the bar for secure, reliable and scalable
SQL-on-Hadoop,&amp;quot; said Piyush Bhargava, distinguished engineer, IT, Cisco Systems.
&amp;quot;Because Drill integrates with existing data virtualization and visualization
tools, we expect it will improve adoption of self-service data exploration and large-scale
BI queries on our advanced Hadoop platform at Cisco.&amp;quot;&lt;/p&gt;
+&lt;p&gt;&amp;quot;MicroStrategy recognized early on the value of Apache Drill
and is one of the first analytic platforms to certify Drill,&amp;quot; said Tim Lang,
senior executive vice president and chief technology officer at MicroStrategy Incorporated.
 &amp;quot;Because Drill is designed to be used with a minimal learning curve, it opens
up more complex data sets to the end user who can immediately visualize and analyze new information
using MicroStrategy’s advanced capabilities.&amp;quot;&lt;/p&gt;
+&lt;p&gt;&amp;quot;Apache Drill closes a gap around self-service SQL queries
in Hadoop, especially on complex, dynamic NoSQL data types,&amp;quot; said Mike Foster,
Strategic Alliances Technology Officer at Qlik.  &amp;quot;Drill&amp;#39;s performance
advantages for Hadoop data access, combined with the Qlik associative experience, enables
our customers to continue discovering business value from a wide range of data. Congratulations
to the Apache Drill community.&amp;quot;&lt;/p&gt;
+&lt;p&gt;&amp;quot;Apache Drill empowers people to access data that is traditionally
difficult to work with,&amp;quot; said Jeff Feng, product manager, Tableau.  &amp;quot;Direct
access within a centralized data repository and without pre-generating metadata definitions
encourages data democracy which is essential for data-driven organizations. Additionally,
Drill&amp;#39;s instant and secure access to complex data formats, such as JSON, opens
up extended analytical opportunities.&amp;quot;&lt;/p&gt;
+&lt;p&gt;&amp;quot;Congratulations to the Apache Drill community on the availability
of 1.0,&amp;quot; said Karl Van den Bergh, Vice President, Products and Cloud at TIBCO.
&amp;quot;Drill promises to bring low-latency access to data stored in Hadoop and HBase
via standard SQL semantics. This innovation is in line with the value of Fast Data analysis,
which TIBCO customers welcome and appreciate.&amp;quot;&lt;/p&gt;
+&lt;p&gt;&amp;quot;The community&amp;#39;s accomplishment is a testament
to The Apache Software Foundation&amp;#39;s ability to bring together diverse companies
to work towards a common goal. None of this would have been possible without the contribution
of engineers with advanced degrees and experience in relational databases, data warehousing,
MPP, query optimization, Hadoop and NoSQL,&amp;quot; added Nadeau. &amp;quot;Our community&amp;#39;s
strength is what will solidify Apache Drill as a key data technology for the next decade.
We welcome interested individuals to learn more about Drill by joining the community&amp;#39;s
mailing lists, attending upcoming talks by Drill code committers at various conferences including
Hadoop Summit, NoSQL Now, Hadoop World, or at a local Apache Drill MeetUp.&amp;quot;&lt;/p&gt;
+&lt;p&gt;Availability and Oversight
+Apache Drill 1.0 is available immediately as a free download from &lt;a href=&quot;http://drill.apache.org/download/&quot;&gt;http://drill.apache.org/download/&lt;/a&gt;.
Documentation is available at &lt;a href=&quot;http://drill.apache.org/docs/&quot;&gt;http://drill.apache.org/docs/&lt;/a&gt;.
As with all Apache products, Apache Drill software is released under the Apache License v2.0,
and is overseen by a self-selected team of active contributors to the project. A Project Management
Committee (PMC) guides the project&amp;#39;s day-to-day operations, including community
development and product releases. For ways to become involved with Apache Drill, visit &lt;a
and @ApacheDrill on Twitter.&lt;/p&gt;
+&lt;p&gt;About The Apache Software Foundation (ASF)
+Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source
projects, including Apache HTTP Server --the world&amp;#39;s most popular Web server software.
Through the ASF&amp;#39;s meritocratic process known as &amp;quot;The Apache Way,&amp;quot;
more than 500 individual Members and 4,500 Committers successfully collaborate to develop
freely available enterprise-grade software, benefiting millions of users worldwide: thousands
of software solutions are distributed under the Apache License; and the community actively
participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation&amp;#39;s
official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization,
funded by individual donations and corporate sponsors including Bloomberg, Budget Direct,
Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, IBM, InMotion Hosting,
iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco,
  and Yahoo. For more information, visit &lt;a href=&quot;http://www.apache.org/&quot;&gt;http://www.apache.org/&lt;/a&gt;
or follow @TheASF on Twitter.&lt;/p&gt;
+&lt;p&gt;© The Apache Software Foundation. &amp;quot;Apache&amp;quot;, &amp;quot;Apache
Drill&amp;quot;, &amp;quot;Drill&amp;quot;, &amp;quot;Apache Hadoop&amp;quot;,
&amp;quot;Hadoop&amp;quot;, &amp;quot;Apache Parquet&amp;quot;, &amp;quot;Parquet&amp;quot;,
and &amp;quot;ApacheCon&amp;quot;, are registered trademarks or trademarks of The
Apache Software Foundation. All other brands and trademarks are the property of their respective
+&lt;p&gt;# # #&lt;/p&gt;
+        <pubDate>Tue, 19 May 2015 00:00:00 -0700</pubDate>
+        <link>/blog/2015/05/19/the-apache-software-foundation-announces-apache-drill-1.0/</link>
+        <guid isPermaLink="true">/blog/2015/05/19/the-apache-software-foundation-announces-apache-drill-1.0/</guid>
+        <category>blog</category>
+      </item>
+      <item>
+        <title>Drill 1.0 Released</title>
+        <description>&lt;p&gt;We embarked on the Drill project in late 2012
with two primary objectives:&lt;/p&gt;
+&lt;li&gt;Enable agility by getting rid of all the traditional overhead - namely,
the need to load data, create and maintain schemas, transform data, etc. We wanted to develop
a system that would support the speed and agility at which modern organizations want (or need)
to operate in this era.&lt;/li&gt;
+&lt;li&gt;Unlock the data housed in non-relational datastores like NoSQL, Hadoop
and cloud storage, making it available not only to developers, but also business users, analysts,
data scientists and anyone else who can write a SQL query or use a BI tool. Non-relational
datastores are capturing an increasing share of the world&amp;#39;s data, and it&amp;#39;s
incredibly hard to explore and analyze this data.&lt;/li&gt;
+&lt;p&gt;Today we&amp;#39;re happy to announce the availability of the production-ready
Drill 1.0 release. This release addresses &lt;a href=&quot;https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313820&amp;amp;version=12325568&quot;&gt;228
JIRAs&lt;/a&gt; on top of the 0.9 release earlier this month. Highlights include:&lt;/p&gt;
+&lt;li&gt;Substantial improvements in stability, memory handling and performance&lt;/li&gt;
+&lt;li&gt;Improvements in Drill CLI experience with addition of convenience shortcuts
and improved colors/alignment&lt;/li&gt;
+&lt;li&gt;Substantial additions to documentation including coverage of troubleshooting,
performance tuning and many additions to the SQL reference&lt;/li&gt;
+&lt;li&gt;Enhancements in join planning to facilitate high speed planning of large
and complicated joins&lt;/li&gt;
+&lt;li&gt;Add support for new context functions including &lt;code&gt;CURRENT_USER&lt;/code&gt;
and &lt;code&gt;CURRENT_SCHEMA&lt;/code&gt;&lt;/li&gt;
+&lt;li&gt;Ability to treat all numbers as approximate decimals when reading JSON&lt;/li&gt;
+&lt;li&gt;Enhancements in Drill&amp;#39;s text and CSV handling to support first
row skipping, configurable field/line delimiters and configurable quoting&lt;/li&gt;
+&lt;li&gt;Improved JDBC compatibility (and tracing proxy for easy debugging).&lt;/li&gt;
+&lt;li&gt;Ability to do JDBC connections with direct urls (avoiding ZooKeeper)&lt;/li&gt;
+&lt;li&gt;Automatic selection of spooling or back-pressure exchange semantics to
avoid distributed deadlocks in complex sort-heavy queries&lt;/li&gt;
+&lt;li&gt;Improvements in query profile reporting&lt;/li&gt;
+&lt;li&gt;Addition of &lt;code&gt;ILIKE(VARCHAR, PATTERN)&lt;/code&gt;
and &lt;code&gt;SUBSTR(VARCHAR, REGEX)&lt;/code&gt; functions&lt;/li&gt;
+&lt;p&gt;We would not have been able to reach this milestone without the tremendous
effort by all the &lt;a href=&quot;/team/&quot;&gt;committers&lt;/a&gt;
and contributors, and we would like to congratulate the entire community on achieving this
milestone. While 1.0 is an exciting milestone, it&amp;#39;s really just the beginning
of the journey. We&amp;#39;ll release 1.1 next month, and continue with our 4-6 week release
cycle, so you can count on many additional enhancements over the coming months.&lt;/p&gt;
+&lt;p&gt;Also be sure to check out the &lt;a href=&quot;/blog/2015/05/19/the-apache-software-foundation-announces-apache-drill-1.0/&quot;&gt;Apache
Software Foundation&amp;#39;s press release&lt;/a&gt;.&lt;/p&gt;
+&lt;p&gt;Happy Drilling!&lt;br&gt;
+Tomer Shiran and Jacques Nadeau&lt;/p&gt;
+        <pubDate>Tue, 19 May 2015 00:00:00 -0700</pubDate>
+        <link>/blog/2015/05/19/drill-1.0-released/</link>
+        <guid isPermaLink="true">/blog/2015/05/19/drill-1.0-released/</guid>
+        <category>blog</category>
+      </item>
+      <item>
         <title>Drill 0.9 Released</title>
         <description>&lt;p&gt;It has been about a month since the release of
Drill 0.8, which included &lt;a href=&quot;/blog/drill-0.8-released/&quot;&gt;more
than 240 improvements&lt;/a&gt;. Today we&amp;#39;re happy to announce the availability
of Drill 0.9, providing additional enhancements and bug fixes. In fact, this release includes
&lt;a href=&quot;https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313820&amp;amp;version=12328813&quot;&gt;200
resolved JIRAs&lt;/a&gt;. Some of the noteworthy features in Drill 0.9 are:&lt;/p&gt;
@@ -406,141 +499,5 @@ Tomer Shiran&lt;/p&gt;
-      <item>
-        <title>Running SQL Queries on Amazon S3</title>
-        <description>&lt;p&gt;The functionality and sheer usefulness of Drill
is growing fast.  If you&amp;#39;re a user of some of the popular BI tools out there like
Tableau or SAP Lumira, now is a good time to take a look at how Drill can make your life easier,
especially if  you&amp;#39;re faced with the task of quickly getting a handle on large
sets of unstructured data.  With schema generated on the fly, you can save a lot of time and
headaches by running SQL queries on the data where it rests without knowing much about columns
or formats.  There&amp;#39;s even more good news:  Drill also works with data stored in
the cloud.  With a few simple steps, you can configure the S3 storage plugin for Drill and
be off to the races running queries.  In this post we&amp;#39;ll look at how to configure
Drill to access data stored in an S3 bucket.&lt;/p&gt;
-&lt;p&gt;If you&amp;#39;re more of a visual person, you can skip this article
entirely and &lt;a href=&quot;https://www.youtube.com/watch?v=w8gZ2nn_ZUQ&quot;&gt;go
straight to a video&lt;/a&gt; I put together that walks through an end-to-end example
with Tableau.  This example is easily extended to other BI tools, as the steps are identical
on the Drill side.&lt;/p&gt;
-&lt;p&gt;At a high level, configuring Drill to access S3 bucket data is accomplished
with the following steps on each node running a drillbit.&lt;/p&gt;
-&lt;li&gt;Download and install the &lt;a href=&quot;http://www.jets3t.org/&quot;&gt;JetS3t&lt;/a&gt;
JAR files and enable them.&lt;/li&gt;
-&lt;li&gt;Add your S3 credentials in the relevant XML configuration file.&lt;/li&gt;
-&lt;li&gt;Configure and enable the S3 storage plugin through the Drill web interface.&lt;/li&gt;
-&lt;li&gt;Connect your BI tool of choice and query away.&lt;/li&gt;
-&lt;p&gt;Consult the &lt;a href=&quot;https://cwiki.apache.org/confluence/display/DRILL/Architectural+Overview&quot;&gt;Architectural
Overview&lt;/a&gt; for a refresher on the architecture of Drill.&lt;/p&gt;
-&lt;h3 id=&quot;prerequisites&quot;&gt;Prerequisites&lt;/h3&gt;
-&lt;p&gt;These steps assume you have a &lt;a href=&quot;https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes&quot;&gt;typical
Drill cluster and ZooKeeper quorum&lt;/a&gt; configured and running.  To access data
in S3, you will need an S3 bucket configured and have the required Amazon security credentials
in your possession.  An &lt;a href=&quot;http://blogs.aws.amazon.com/security/post/Tx1R9KDN9ISZ0HF/Where-s-my-secret-access-key&quot;&gt;Amazon
blog post&lt;/a&gt; has more information on how to get these from your account.&lt;/p&gt;
-&lt;h3 id=&quot;configuration-steps&quot;&gt;Configuration Steps&lt;/h3&gt;
-&lt;p&gt;To connect Drill to S3, all of the drillbit nodes will need to access code
in the JetS3t library developed by Amazon.  As of this writing, 0.9.2 is the latest version
but you might want to check &lt;a href=&quot;https://jets3t.s3.amazonaws.com/toolkit/toolkit.html&quot;&gt;the
main page&lt;/a&gt; to see if anything has been updated.  Be sure to get version 0.9.2
or later as earlier versions have a bug relating to reading Parquet data.&lt;/p&gt;
-&lt;div class=&quot;highlight&quot;&gt;&lt;pre&gt;&lt;code class=&quot;language-bash&quot;
data-lang=&quot;bash&quot;&gt;wget http://bitbucket.org/jmurty/jets3t/downloads/jets3t-0.9.2.zip
-cp jets3t-0.9.2/jars/jets3t-0.9.2.jar &lt;span class=&quot;nv&quot;&gt;$DRILL_HOME&lt;/span&gt;/jars/3rdparty
-&lt;p&gt;Next, enable the plugin by editing the file:&lt;/p&gt;
-&lt;div class=&quot;highlight&quot;&gt;&lt;pre&gt;&lt;code class=&quot;language-bash&quot;
data-lang=&quot;bash&quot;&gt;&lt;span class=&quot;nv&quot;&gt;$DRILL_HOME&lt;/span&gt;/bin/hadoop_excludes.txt
-&lt;p&gt;and removing the line &lt;code&gt;jets3t&lt;/code&gt;.&lt;/p&gt;
-&lt;p&gt;Drill will need to know your S3 credentials in order to access data there.
These credentials will need to be placed in the core-site.xml file for your installation.
 If you already have a core-site.xml file configured for your environment, add the following
parameters to it, otherwise create the file from scratch.  If you do end up creating it from
scratch you will need to wrap these parameters with &lt;code&gt;&amp;lt;configuration&amp;gt;&lt;/code&gt;
and &lt;code&gt;&amp;lt;/configuration&amp;gt;&lt;/code&gt;.&lt;/p&gt;
-&lt;div class=&quot;highlight&quot;&gt;&lt;pre&gt;&lt;code class=&quot;language-xml&quot;
data-lang=&quot;xml&quot;&gt;&lt;span class=&quot;nt&quot;&gt;&amp;lt;property&amp;gt;&lt;/span&gt;
-  &lt;span class=&quot;nt&quot;&gt;&amp;lt;name&amp;gt;&lt;/span&gt;fs.s3.awsAccessKeyId&lt;span
-  &lt;span class=&quot;nt&quot;&gt;&amp;lt;value&amp;gt;&lt;/span&gt;ID&lt;span
-&lt;span class=&quot;nt&quot;&gt;&amp;lt;/property&amp;gt;&lt;/span&gt;
-&lt;span class=&quot;nt&quot;&gt;&amp;lt;property&amp;gt;&lt;/span&gt;
-  &lt;span class=&quot;nt&quot;&gt;&amp;lt;name&amp;gt;&lt;/span&gt;fs.s3.awsSecretAccessKey&lt;span
-  &lt;span class=&quot;nt&quot;&gt;&amp;lt;value&amp;gt;&lt;/span&gt;SECRET&lt;span
-&lt;span class=&quot;nt&quot;&gt;&amp;lt;/property&amp;gt;&lt;/span&gt;
-&lt;span class=&quot;nt&quot;&gt;&amp;lt;property&amp;gt;&lt;/span&gt;
-  &lt;span class=&quot;nt&quot;&gt;&amp;lt;name&amp;gt;&lt;/span&gt;fs.s3n.awsAccessKeyId&lt;span
-  &lt;span class=&quot;nt&quot;&gt;&amp;lt;value&amp;gt;&lt;/span&gt;ID&lt;span
-&lt;span class=&quot;nt&quot;&gt;&amp;lt;/property&amp;gt;&lt;/span&gt;
-&lt;span class=&quot;nt&quot;&gt;&amp;lt;property&amp;gt;&lt;/span&gt;
-  &lt;span class=&quot;nt&quot;&gt;&amp;lt;name&amp;gt;&lt;/span&gt;fs.s3n.awsSecretAccessKey&lt;span
-  &lt;span class=&quot;nt&quot;&gt;&amp;lt;value&amp;gt;&lt;/span&gt;SECRET&lt;span
-&lt;span class=&quot;nt&quot;&gt;&amp;lt;/property&amp;gt;&lt;/span&gt;
-&lt;p&gt;The steps so far give Drill enough information to connect to the S3 service.
 Remember, you have to do this on all the nodes running drillbit.&lt;/p&gt;
-&lt;p&gt;Next, let&amp;#39;s go into the Drill web interface and enable the S3
storage plugin.  In this case you only need to connect to &lt;strong&gt;one&lt;/strong&gt;
of the nodes because Drill&amp;#39;s configuration is synchronized across the cluster.
 Complete the following steps:&lt;/p&gt;
-&lt;li&gt;Point your browser to &lt;code&gt;http://&amp;lt;host&amp;gt;:8047&lt;/code&gt;&lt;/li&gt;
-&lt;li&gt;Select the &amp;#39;Storage&amp;#39; tab.&lt;/li&gt;
-&lt;li&gt;A good starting configuration for S3 can be entirely the same as the &lt;code&gt;dfs&lt;/code&gt;
plugin, except the connection parameter is changed to &lt;code&gt;s3://bucket&lt;/code&gt;.
 So first select the &lt;code&gt;Update&lt;/code&gt; button for &lt;code&gt;dfs&lt;/code&gt;,
then select the text area and copy it into the clipboard (on Windows, ctrl-A, ctrl-C works).&lt;/li&gt;
-&lt;li&gt;Press &lt;code&gt;Back&lt;/code&gt;, then create a new
plugin by typing the name into the &lt;code&gt;New Storage Plugin&lt;/code&gt;,
then press &lt;code&gt;Create&lt;/code&gt;.  You can choose any name, but
a good convention is to use &lt;code&gt;s3-&amp;lt;bucketname&amp;gt;&lt;/code&gt;
so you can easily identify it later.&lt;/li&gt;
-&lt;li&gt;In the configuration area, paste the configuration you just grabbed from
&amp;#39;dfs&amp;#39;.  Change the line &lt;code&gt;connection: &amp;quot;file:///&amp;quot;&lt;/code&gt;
to &lt;code&gt;connection: &amp;quot;s3://&amp;lt;bucket&amp;gt;&amp;quot;&lt;/code&gt;.&lt;/li&gt;
-&lt;li&gt;Click &lt;code&gt;Update&lt;/code&gt;.  You should see
a message that indicates success.&lt;/li&gt;
-&lt;p&gt;At this point you can run queries on the data directly and you have a couple
of options on how you want to access it.  You can use Drill Explorer and create a custom view
(based on an SQL query) that you can then access in Tableau or other BI tools, or just use
Drill directly from within the tool.&lt;/p&gt;
-&lt;p&gt;You may want to check out the &lt;a href=&quot;http://www.youtube.com/watch?v=jNUsprJNQUg&quot;&gt;Tableau
-&lt;p&gt;With just a few lines of configuration, you&amp;#39;ve just opened the
vast world of data available in the Amazon cloud and reduced the amount of work you have to
do in advance to access data stored there with SQL.  There are even some &lt;a href=&quot;https://aws.amazon.com/datasets&quot;&gt;public
datasets&lt;/a&gt; available directly on S3 that are great for experimentation.&lt;/p&gt;
-&lt;p&gt;Happy Drilling!&lt;/p&gt;
-        <pubDate>Tue, 09 Dec 2014 10:50:01 -0800</pubDate>
-        <link>/blog/2014/12/09/running-sql-queries-on-amazon-s3/</link>
-        <guid isPermaLink="true">/blog/2014/12/09/running-sql-queries-on-amazon-s3/</guid>
-        <category>blog</category>
-      </item>
-      <item>
-        <title>Apache Drill Graduates to a Top-Level Project</title>
-        <description>&lt;p&gt;The Apache Software Foundation has just announced
that it has promoted Drill to a top-level project at Apache, similar to other well-known projects
like Apache Hadoop and httpd (the world&amp;#39;s most popular Web server). This marks
a significant accomplishment for the Drill community, and I wanted to personally thank everyone
who has contributed to the project. It takes many people, and countless hours, to develop
something as complex and innovative as Drill.&lt;/p&gt;
-&lt;p&gt;In this post I wanted to reflect on the past and future of Drill.&lt;/p&gt;
-&lt;h2 id=&quot;why-we-started-drill&quot;&gt;Why We Started Drill&lt;/h2&gt;
-&lt;h3 id=&quot;the-evolution-of-application-development-and-data&quot;&gt;The
Evolution of Application Development and Data&lt;/h3&gt;
-&lt;p&gt;Over the last decade, organizations have been striving to become more agile
and data-driven, seeking to gain competitive advantage in their markets. This trend has led
to dramatic changes in the way applications are built and delivered, and in the type and volume
of data that is being leveraged.&lt;/p&gt;
-&lt;p&gt;&lt;strong&gt;Applications&lt;/strong&gt;: In previous decades,
software development was a carefully orchestrated and planned process. The release cycles
were often measured in years, and upgrades were infrequent. Today, Web and mobile applications
are developed in a much more iterative fashion. The release cycles are measured in days or
weeks, and upgrades are a non-issue. (What version of Salesforce.com or Google Maps are you
-&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;: In previous decades,
data was measured in MBs or GBs, and it was highly structured and denormalized. Today&amp;#39;s
data is often measured in TBs or PBs, and it tends to be multi-structured — a combination
of unstructured, semi-structured and structured. The data comes from many different sources,
including a variety of applications, devices and services, and its structure changes much
more frequently.&lt;/p&gt;
-&lt;h3 id=&quot;a-new-generation-of-datastores&quot;&gt;A New Generation
of Datastores&lt;/h3&gt;
-&lt;p&gt;The relational database, which was invented in 1970, was not designed for
these new processes and data volumes and structures. As a result, a new generation of datastores
has emerged, including HDFS, NoSQL (HBase, MongoDB, etc.) and search (Elasticsearch, Solr).
 These systems are schema-free (also known as &amp;quot;dynamic schema&amp;quot;).
Applications, as opposed to DBAs, control the data structure, enabling more agility and flexibility.
For example, an application developer can independently evolve the data structure with each
application release (which could be daily or weekly) without filing a ticket with IT and waiting
for the schema of the databae to be modified.&lt;/p&gt;
-&lt;h2 id=&quot;the-need-for-a-new-query-engine&quot;&gt;The Need for a New
Query Engine&lt;/h2&gt;
-&lt;p&gt;With data increasingly being stored in schema-free datastores (HDFS, HBase,
MongoDB, etc.) and a variety of cloud services, users need a way to explore and analyze this
data, and a way to visualize it with BI tools (reports, dashboards, etc.). In 2012 we decided
to embark on a journey to create the world&amp;#39;s next-generation SQL engine. We had
several high-level requirements in mind:&lt;/p&gt;
-&lt;li&gt;&lt;strong&gt;A schema-free data model.&lt;/strong&gt;
Schema-free datastores (HDFS, NoSQL, search) need a schema-free SQL engine. These datastores
became popular for a reason, and we shouldn&amp;#39;t expect organizations to sacrifice
those advantages in order to enjoy SQL-based analytics and BI. Today&amp;#39;s organizations
need agility and flexibility to cope with the volume, variety and velocity associated with
modern applications and data.&lt;br&gt;&lt;/li&gt;
-&lt;li&gt;&lt;strong&gt;A standalone query engine that supports multiple
data sources.&lt;/strong&gt; Most companies now use a variety of best-of-breed datastores
and services to store data. This is true not just for large Global 2000 companies, but also
for small startups. For example, it is not uncommon for a startup to have data in MySQL, MongoDB,
HBase and HDFS, as well as a variety of online services. ETL was hard even 10 years ago when
data was static and 100x smaller than it is today, and in today&amp;#39;s era of Big Data
it&amp;#39;s often impractical or impossible to ingest all the data into a single system.&lt;/li&gt;
-&lt;li&gt;&lt;strong&gt;Ease of use.&lt;/strong&gt; The SQL engine
can&amp;#39;t be hard to setup and use. Analysts and developers should be able to download
and use it without deploying any complex infrastructure such as Hadoop.&lt;/li&gt;
-&lt;li&gt;&lt;strong&gt;Scalability and performance.&lt;/strong&gt;
The SQL engine must support interactive queries. It can&amp;#39;t be batch-oriented like
Hive. In addition, it must be able to scale linearly from a small laptop or virtual machine
to a large cluster with hundreds or thousands of powerful servers.&lt;/li&gt;
-&lt;p&gt;With these requirements in mind, we decided to incubate a new project in
2012 in the Apache Software Foundation so that a community of vendors and developers could
come together and develop the technology. (One little known fact is that the name &amp;quot;Drill&amp;quot;
was actually suggested by Google engineers due to its inspiration from Google&amp;#39;s
Dremel execution engine.)&lt;/p&gt;
-&lt;p&gt;After almost two years of research and development, we released Drill 0.4
in August, and continued with monthly releases since then.&lt;/p&gt;
-&lt;h2 id=&quot;what&amp;#39;s-next&quot;&gt;What&amp;#39;s Next&lt;/h2&gt;
-&lt;p&gt;Graduating to a top-level project is a significant milestone, but it&amp;#39;s
really just the beginning of the journey. In fact, we&amp;#39;re currently wrapping up
Drill 0.7, which includes hundreds of fixes and enhancements, and we expect to release that
in the next couple weeks.&lt;/p&gt;
-&lt;p&gt;Drill is currently being used by dozens of organizations, ranging from small
startups to some of the largest Fortune 100s. These organizations are already gaining tremendous
business value with Drill. As we march towards a 1.0 release early next year, these organizations
are helping us shape the project and ensure that it meets the needs of a broad range of organizations
as well as users (business analysts, technical analysts, data scientists and application developers).
I would like to encourage you to join the ride today by &lt;a href=&quot;http://drill.apache.org/download/&quot;&gt;downloading
Drill&lt;/a&gt; and &lt;a href=&quot;mailto:user@drill.apache.org&quot;&gt;letting
us know&lt;/a&gt; what you think.&lt;/p&gt;
-&lt;p&gt;Happy Drilling!&lt;br&gt;
-Tomer Shiran&lt;/p&gt;
-        <pubDate>Tue, 02 Dec 2014 08:00:00 -0800</pubDate>
-        <link>/blog/2014/12/02/drill-top-level-project/</link>
-        <guid isPermaLink="true">/blog/2014/12/02/drill-top-level-project/</guid>
-        <category>blog</category>
-      </item>

diff --git a/index.html b/index.html
index a1e2bdc..40d0b98 100644
--- a/index.html
+++ b/index.html
@@ -7,7 +7,7 @@
 <meta name=viewport content="width=device-width, initial-scale=1">
-<title>Apache Drill - Self-Service Data Exploration</title>
+<title>Apache Drill - Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage</title>
 <link href="//maxcdn.bootstrapcdn.com/font-awesome/4.3.0/css/font-awesome.min.css" rel="stylesheet"
 <link href="/css/site.css" rel="stylesheet" type="text/css"/>
@@ -135,7 +135,7 @@ $(document).ready(function() {
       this.href = url
     autoplay: true,
     autoplaySpeed: 5000,
@@ -163,7 +163,7 @@ $(document).ready(function() {
         <div class="slide"><a class="various fancybox.iframe" href="//www.youtube.com/watch?v=6pGeQOXDdD8"><img
src="/images/thumbnail-6pGeQOXDdD8.jpg" class="thumbnail" /><img src="/images/play-mq.png"
class="play" /></a><div class="title">High Performance with a JSON Data Model</div></div>
       <h1 class="main-headline">Apache Drill</h1>
-      <h2 id="sub-headline">Schema-free SQL Query Engine <br class="mobile-break"
/> for Hadoop, NoSQL and Cloud Storage</h2>
+      <h2 id="sub-headline">Schema-free SQL Query Engine <br class="mobile-break"
/>for Hadoop, NoSQL and <br class="mobile-break" />Cloud Storage</h2>
       <a href="/download/" class="download-headline btn btn-1 btn-1c"><span>DOWNLOAD
@@ -171,7 +171,7 @@ $(document).ready(function() {
 </div><!-- header -->
 <div class="alertbar">
-  <div class="news">News:</div><div><a href="/blog/2015/05/04/drill-0.9-released/">Drill
0.9 Released</a><br/><span>(Tomer Shiran, Jacques Nadeau)</span></div><div><a
href="/blog/2015/04/30/apache-parquet-graudates-to-a-top-level-project/">Apache Parquet
Graduates to a Top-Level Project</a><br/><span>(Tomer Shiran)</span></div>
+  <div class="news">News:</div><div><a href="/blog/2015/05/19/the-apache-software-foundation-announces-apache-drill-1.0/">The
ASF Announces Drill 1.0</a><br/><span>(Tomer Shiran)</span></div><div><a
href="/blog/2015/05/19/drill-1.0-released/">Drill 1.0 Released</a><br/><span>(Tomer
Shiran, Jacques Nadeau)</span></div>
 <div class="mw introWrapper">

View raw message