drill-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From tshi...@apache.org
Subject svn commit: r1651949 [2/13] - in /drill/site/trunk/content/drill: ./ blog/2014/11/19/sql-on-mongodb/ blog/2014/12/02/drill-top-level-project/ blog/2014/12/09/running-sql-queries-on-amazon-s3/ blog/2014/12/11/apache-drill-qa-panelist-spotlight/ blog/201...
Date Thu, 15 Jan 2015 05:11:48 GMT
Added: drill/site/trunk/content/drill/docs/analyzing-yelp-json-data-with-apache-drill/index.html
URL: http://svn.apache.org/viewvc/drill/site/trunk/content/drill/docs/analyzing-yelp-json-data-with-apache-drill/index.html?rev=1651949&view=auto
==============================================================================
--- drill/site/trunk/content/drill/docs/analyzing-yelp-json-data-with-apache-drill/index.html (added)
+++ drill/site/trunk/content/drill/docs/analyzing-yelp-json-data-with-apache-drill/index.html Thu Jan 15 05:11:44 2015
@@ -0,0 +1,468 @@
+<!DOCTYPE html>
+<html>
+
+<head>
+
+<meta charset="UTF-8">
+
+
+<title>Analyzing Yelp JSON Data with Apache Drill - Apache Drill</title>
+
+<link href="/css/syntax.css" rel="stylesheet" type="text/css">
+<link href="/css/style.css" rel="stylesheet" type="text/css">
+<link href="/css/arrows.css" rel="stylesheet" type="text/css">
+<link href="/css/button.css" rel="stylesheet" type="text/css">
+
+<link rel="shortcut icon" href="/favicon.ico" type="image/x-icon">
+<link rel="icon" href="/favicon.ico" type="image/x-icon">
+
+<script language="javascript" type="text/javascript" src="/js/lib/jquery-1.11.1.min.js"></script>
+<script language="javascript" type="text/javascript" src="/js/lib/jquery.easing.1.3.js"></script>
+<script language="javascript" type="text/javascript" src="/js/modernizr.custom.js"></script>
+<script language="javascript" type="text/javascript" src="/js/script.js"></script>
+
+</head>
+
+<body onResize="resized();">
+
+<div class="bui"></div>
+
+<div id="search">
+<input type="text" placeholder="Enter search term here">
+</div>
+
+<div id="menu" class="mw">
+<ul>
+  <li class="logo"><a href="/"></a></li>
+  <li>
+    <a href="/overview/">Documentation</a>
+    <ul>
+      <li><a href="/overview/">Overview&nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes" target="_blank">Drill in 10 Minutes</a></li>
+      <li><a href="/why/">Why Drill? &nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="/architecture/">Architecture</a></li>
+    </ul>
+  </li>
+  <li>
+    <a href="/community/">Community</a>
+    <ul>
+      <li><a href="/team/">Team</a></li>
+      <li><a href="/community/#events">Events and Meetups</a></li>
+      <li><a href="/community/#mailinglists">Mailing Lists</a></li>
+      <li><a href="/community/#getinvolved">Get Involved</a></li>
+      <li><a href="https://issues.apache.org/jira/browse/DRILL/" target="_blank">Issue Tracker</a></li>
+      <li><a href="https://github.com/apache/drill" target="_blank">GitHub</a></li>
+    </ul>
+  </li>
+  <li><a href="/faq/">FAQ</a></li>
+  <li><a href="/blog/">Blog</a></li>
+  <li style="width:30px; padding-left: 2px; padding-right:10px"><a href="https://twitter.com/apachedrill" target="_blank"><img src="/images/twitterbw.png" alt="" align="center" width="22" style="padding: 0px 10px 1px 0px;"></a> </li>
+  <li class="l"><span>&nbsp;</span></li>
+  <li class="d"><a href="/download/">Download</a></li>
+</ul>
+</div>
+
+<div class="int_title">
+<h1>Analyzing Yelp JSON Data with Apache Drill</h1>
+
+</div>
+
+<div class="int_text" align="left"><p><a href="https://www.mapr.com/products/apache-drill">Apache Drill</a> is one of the
+fastest growing open source projects, with the community making rapid progress
+with monthly releases. The key difference is Drill’s agility and flexibility.
+Along with meeting the table stakes for SQL-on-Hadoop, which is to achieve low
+latency performance at scale, Drill allows users to analyze the data without
+any ETL or up-front schema definitions. The data could be in any file format
+such as text, JSON, or Parquet. Data could have simple types such as string,
+integer, dates, or more complex multi-structured data, such as nested maps and
+arrays. Data can exist in any file system, local or distributed, such as HDFS,
+<a href="https://www.mapr.com/blog/comparing-mapr-fs-and-hdfs-nfs-and-%0Asnapshots">MapR FS</a>, or S3. Drill, has a “no schema” approach, which enables you to get
+value from your data in just a few minutes.</p>
+
+<p>Let’s quickly walk through the steps required to install Drill and run it
+against the Yelp data set. The publicly available data set used for this
+example is downloadable from <a href="http://www.yelp.com/dataset_challenge">Yelp</a>
+(business reviews) and is in JSON format.</p>
+
+<h2 id="installing-and-starting-drill">Installing and Starting Drill</h2>
+
+<h3 id="step-1:-download-apache-drill-onto-your-local-machine">Step 1: Download Apache Drill onto your local machine</h3>
+
+<p><a href="http://incubator.apache.org/drill/download/">http://incubator.apache.org/drill/download/</a></p>
+
+<p>You can also <a href="https://cwiki.apache.org/conflue%0Ance/display/DRILL/Deploying+Apache+Drill+in+a+Clustered+Environment">deploy Drill in clustered mode</a> if you
+want to scale your environment.</p>
+
+<h3 id="step-2-:-open-the-drill-tar-file">Step 2 : Open the Drill tar file</h3>
+
+<p><code>tar -xvf apache-drill-0.6.0-incubating.tar</code></p>
+
+<h3 id="step-3:-launch-sqlline,-a-jdbc-application-that-ships-with-drill">Step 3: Launch sqlline, a JDBC application that ships with Drill</h3>
+
+<p><code>bin/sqlline -u jdbc:drill:zk=local</code></p>
+
+<p>That’s it! You are now ready explore the data.</p>
+
+<p>Let’s try out some SQL examples to understand how Drill makes the raw data
+analysis extremely easy.</p>
+
+<p><strong>Note</strong>: You need to substitute your local path to the Yelp data set in the FROM clause of each query you run.</p>
+
+<h2 id="querying-data-with-drill">Querying Data with Drill</h2>
+
+<h3 id="1.-view-the-contents-of-the-yelp-business-data"><strong>1. View the contents of the Yelp business data</strong></h3>
+
+<p><code>0: jdbc:drill:zk=local&gt; !set maxwidth 10000</code></p>
+
+<p><code>0: jdbc:drill:zk=local&gt; select * from
+dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json`
+limit 1;</code></p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">+-------------+--------------+------------+------------+------------+------------+--------------+------------+------------+------------+------------+------------+------------+------------+---------------+
+| business_id | full_address |   hours    |     open    | categories |            city    | review_count |        name   | longitude  |   state  |   stars          |  latitude  | attributes |          type    | neighborhoods |
++-------------+--------------+------------+------------+------------+------------+--------------+------------+------------+------------+------------+------------+------------+------------+---------------+
+| vcNAWiLM4dR7D2nwwJ7nCA | 4840 E Indian School Rd
+Ste 101
+Phoenix, AZ 85018 | {&quot;Tuesday&quot;:{&quot;close&quot;:&quot;17:00&quot;,&quot;open&quot;:&quot;08:00&quot;},&quot;Friday&quot;:{&quot;close&quot;:&quot;17:00&quot;,&quot;open&quot;:&quot;08:00&quot;},&quot;Monday&quot;:{&quot;close&quot;:&quot;17:00&quot;,&quot;open&quot;:&quot;08:00&quot;},&quot;Wednesday&quot;:{&quot;close&quot;:&quot;17:00&quot;,&quot;open&quot;:&quot;08:00&quot;},&quot;Thursday&quot;:{&quot;close&quot;:&quot;17:00&quot;,&quot;open&quot;:&quot;08:00&quot;},&quot;Sunday&quot;:{},&quot;Saturday&quot;:{}} | true              | [&quot;Doctors&quot;,&quot;Health &amp; Medical&quot;] | Phoenix  | 7                   | Eric Goldberg, MD | -111.983758 | AZ       | 3.5                | 33.499313  | {&quot;By Appointment Only&quot;:true,&quot;Good For&quot;:{},&quot;Ambience&quot;:{},&quot;Parking&quot;:{},&quot;Music&quot;:{},&quot;Hair Types Specialized In&quot;:{},&quot;Payment Types&quot;:{},&quot;Dietary Restrictions&quot;:{}} | business   | []                  
 |
++-------------+--------------+------------+------------+------------+------------+--------------+------------+------------+------------+------------+------------+------------+------------+---------------+
+</code></pre></div>
+<p>**Note: **You can directly query self-describing files such as JSON, Parquet, and text. There is no need to create metadata definitions in the Hive metastore.</p>
+
+<h3 id="2.-explore-the-business-data-set-further"><strong>2. Explore the business data set further</strong></h3>
+
+<h4 id="total-reviews-in-the-data-set">Total reviews in the data set</h4>
+
+<p><code>0: jdbc:drill:zk=local&gt; select sum(review_count) as totalreviews from
+dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json`
+;</code></p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">+--------------+
+| totalreviews |
++--------------+
+| 1236445      |
++--------------+
+</code></pre></div>
+<h4 id="top-states-and-cities-in-total-number-of-reviews">Top states and cities in total number of reviews</h4>
+
+<p><code>0: jdbc:drill:zk=local&gt; select state, city, count(*) totalreviews from
+dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json`
+group by state, city order by count(*) desc limit 10;</code></p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">+------------+------------+--------------+
+|   state    |    city    | totalreviews |
++------------+------------+--------------+
+| NV         | Las Vegas  | 12021        |
+| AZ         | Phoenix    | 7499         |
+| AZ         | Scottsdale | 3605         |
+| EDH        | Edinburgh  | 2804         |
+| AZ         | Mesa       | 2041         |
+| AZ         | Tempe      | 2025         |
+| NV         | Henderson  | 1914         |
+| AZ         | Chandler   | 1637         |
+| WI         | Madison    | 1630         |
+| AZ         | Glendale   | 1196         |
++------------+------------+--------------+
+</code></pre></div>
+<h4 id="average-number-of-reviews-per-business-star-rating"><strong>Average number of reviews per business star rating</strong></h4>
+
+<p><code>0: jdbc:drill:zk=local&gt; select stars,trunc(avg(review_count)) reviewsavg from
+dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json`
+group by stars order by stars desc;</code></p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">+------------+------------+
+|   stars    | reviewsavg |
++------------+------------+
+| 5.0        | 8.0        |
+| 4.5        | 28.0       |
+| 4.0        | 48.0       |
+| 3.5        | 35.0       |
+| 3.0        | 26.0       |
+| 2.5        | 16.0       |
+| 2.0        | 11.0       |
+| 1.5        | 9.0        |
+| 1.0        | 4.0        |
++------------+------------+
+</code></pre></div>
+<h4 id="top-businesses-with-high-review-counts-(&gt;-1000)"><strong>Top businesses with high review counts (&gt; 1000)</strong></h4>
+
+<p><code>0: jdbc:drill:zk=local&gt; select name, state, city, `review_count` from
+dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json`
+where review_count &gt; 1000 order by `review_count` desc limit 10;</code></p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">+------------+------------+------------+----------------------------+
+|    name                |   state     |    city     | review_count |
++------------+------------+------------+----------------------------+
+| Mon Ami Gabi           | NV          | Las Vegas  | 4084          |
+| Earl of Sandwich       | NV          | Las Vegas  | 3655          |
+| Wicked Spoon           | NV          | Las Vegas  | 3408          |
+| The Buffet             | NV          | Las Vegas  | 2791          |
+| Serendipity 3          | NV          | Las Vegas  | 2682          |
+| Bouchon                | NV          | Las Vegas  | 2419          |
+| The Buffet at Bellagio | NV          | Las Vegas  | 2404          |
+| Bacchanal Buffet       | NV          | Las Vegas  | 2369          |
+| The Cosmopolitan of Las Vegas | NV   | Las Vegas  | 2253          |
+| Aria Hotel &amp; Casino    | NV          | Las Vegas  | 2224          |
++------------+------------+------------+----------------------------+
+</code></pre></div>
+<h4 id="saturday-open-and-close-times-for-a-few-businesses"><strong>Saturday open and close times for a few businesses</strong></h4>
+
+<p><code>0: jdbc:drill:zk=local&gt; select b.name, b.hours.Saturday.`open`,
+b.hours.Saturday.`close`  
+from
+dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json`
+b limit 10;</code></p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">+------------+------------+----------------------------+
+|    name                    |   EXPR$1   |   EXPR$2   |
++------------+------------+----------------------------+
+| Eric Goldberg, MD          | 08:00      | 17:00      |
+| Pine Cone Restaurant       | null       | null       |
+| Deforest Family Restaurant | 06:00      | 22:00      |
+| Culver&#39;s                   | 10:30      | 22:00      |
+| Chang Jiang Chinese Kitchen| 11:00      | 22:00      |
+| Charter Communications     | null       | null       |
+| Air Quality Systems        | null       | null       |
+| McFarland Public Library   | 09:00      | 20:00      |
+| Green Lantern Restaurant   | 06:00      | 02:00      |
+| Spartan Animal Hospital    | 07:30      | 18:00      |
++------------+------------+----------------------------+
+</code></pre></div>
+<p>** **Note how Drill can traverse and refer through multiple levels of nesting.</p>
+
+<h3 id="3.-get-the-amenities-of-each-business-in-the-data-set"><strong>3. Get the amenities of each business in the data set</strong></h3>
+
+<p>Note that the attributes column in the Yelp business data set has a different
+element for every row, representing that businesses can have separate
+amenities. Drill makes it easy to quickly access data sets with changing
+schemas.</p>
+
+<p>First, change Drill to work in all text mode (so we can take a look at all of
+the data).</p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; alter system set `store.json.all_text_mode` = true;
++------------+-----------------------------------+
+|     ok     |  summary                          |
++------------+-----------------------------------+
+| true       | store.json.all_text_mode updated. |
++------------+-----------------------------------+
+</code></pre></div>
+<p>Then, query the attribute’s data.</p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select attributes from dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json` limit 10;
++----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
+| attributes                                                                                                                                                                       |
++----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
+| {&quot;By Appointment Only&quot;:&quot;true&quot;,&quot;Good For&quot;:{},&quot;Ambience&quot;:{},&quot;Parking&quot;:{},&quot;Music&quot;:{},&quot;Hair Types Specialized In&quot;:{},&quot;Payment Types&quot;:{},&quot;Dietary Restrictions&quot;:{}} |
+| {&quot;Take-out&quot;:&quot;true&quot;,&quot;Good For&quot;:{&quot;dessert&quot;:&quot;false&quot;,&quot;latenight&quot;:&quot;false&quot;,&quot;lunch&quot;:&quot;true&quot;,&quot;dinner&quot;:&quot;false&quot;,&quot;breakfast&quot;:&quot;false&quot;,&quot;brunch&quot;:&quot;false&quot;},&quot;Caters&quot;:&quot;false&quot;,&quot;Noise Level&quot;:&quot;averag |
+| {&quot;Take-out&quot;:&quot;true&quot;,&quot;Good For&quot;:{&quot;dessert&quot;:&quot;false&quot;,&quot;latenight&quot;:&quot;false&quot;,&quot;lunch&quot;:&quot;false&quot;,&quot;dinner&quot;:&quot;false&quot;,&quot;breakfast&quot;:&quot;false&quot;,&quot;brunch&quot;:&quot;true&quot;},&quot;Caters&quot;:&quot;false&quot;,&quot;Noise Level&quot;:&quot;quiet&quot; |
+| {&quot;Take-out&quot;:&quot;true&quot;,&quot;Good For&quot;:{},&quot;Takes Reservations&quot;:&quot;false&quot;,&quot;Delivery&quot;:&quot;false&quot;,&quot;Ambience&quot;:{},&quot;Parking&quot;:{&quot;garage&quot;:&quot;false&quot;,&quot;street&quot;:&quot;false&quot;,&quot;validated&quot;:&quot;false&quot;,&quot;lot&quot;:&quot;true&quot;,&quot;val |
+| {&quot;Take-out&quot;:&quot;true&quot;,&quot;Good For&quot;:{},&quot;Ambience&quot;:{},&quot;Parking&quot;:{},&quot;Has TV&quot;:&quot;false&quot;,&quot;Outdoor Seating&quot;:&quot;false&quot;,&quot;Attire&quot;:&quot;casual&quot;,&quot;Music&quot;:{},&quot;Hair Types Specialized In&quot;:{},&quot;Payment Types |
+| {&quot;Good For&quot;:{},&quot;Ambience&quot;:{},&quot;Parking&quot;:{},&quot;Music&quot;:{},&quot;Hair Types Specialized In&quot;:{},&quot;Payment Types&quot;:{},&quot;Dietary Restrictions&quot;:{}} |
+| {&quot;Good For&quot;:{},&quot;Ambience&quot;:{},&quot;Parking&quot;:{},&quot;Music&quot;:{},&quot;Hair Types Specialized In&quot;:{},&quot;Payment Types&quot;:{},&quot;Dietary Restrictions&quot;:{}} |
+| {&quot;Good For&quot;:{},&quot;Ambience&quot;:{},&quot;Parking&quot;:{},&quot;Wi-Fi&quot;:&quot;free&quot;,&quot;Music&quot;:{},&quot;Hair Types Specialized In&quot;:{},&quot;Payment Types&quot;:{},&quot;Dietary Restrictions&quot;:{}} |
+| {&quot;Take-out&quot;:&quot;true&quot;,&quot;Good For&quot;:{&quot;dessert&quot;:&quot;false&quot;,&quot;latenight&quot;:&quot;false&quot;,&quot;lunch&quot;:&quot;false&quot;,&quot;dinner&quot;:&quot;true&quot;,&quot;breakfast&quot;:&quot;false&quot;,&quot;brunch&quot;:&quot;false&quot;},&quot;Noise Level&quot;:&quot;average&quot;,&quot;Takes Reserva |
+| {&quot;Good For&quot;:{},&quot;Ambience&quot;:{},&quot;Parking&quot;:{},&quot;Music&quot;:{},&quot;Hair Types Specialized In&quot;:{},&quot;Payment Types&quot;:{},&quot;Dietary Restrictions&quot;:{}} |
++------------+
+</code></pre></div>
+<p>Turn off the all text mode so we can continue to perform arithmetic operations
+on data.</p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; alter system set `store.json.all_text_mode` = false;
++------------+------------+
+|     ok             |  summary   |
++------------+------------+
+| true              | store.json.all_text_mode updated. |
+</code></pre></div>
+<p><strong>4. Explore the restaurant businesses in the data set</strong></p>
+
+<h4 id="number-of-restaurants-in-the-data-set**-**"><strong>Number of restaurants in the data set</strong>** **</h4>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select count(*) as TotalRestaurants from dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json` where true=repeated_contains(categories,&#39;Restaurants&#39;);
++------------------+
+| TotalRestaurants |
++------------------+
+| 14303            |
++------------------+
+</code></pre></div>
+<h4 id="top-restaurants-in-number-of-reviews"><strong>Top restaurants in number of reviews</strong></h4>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select name,state,city,`review_count` from dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json` where true=repeated_contains(categories,&#39;Restaurants&#39;) order by `review_count` desc limit 10
+. . . . . . . . . . . &gt; ;
++------------+------------+------------+--------------+
+|    name         |   state    |    city     | review_count |
++------------+------------+------------+--------------+
+| Mon Ami Gabi | NV               | Las Vegas  | 4084         |
+| Earl of Sandwich | NV         | Las Vegas  | 3655         |
+| Wicked Spoon | NV             | Las Vegas  | 3408         |
+| The Buffet | NV       | Las Vegas  | 2791         |
+| Serendipity 3 | NV              | Las Vegas  | 2682         |
+| Bouchon       | NV         | Las Vegas  | 2419           |
+| The Buffet at Bellagio | NV             | Las Vegas  | 2404         |
+| Bacchanal Buffet | NV        | Las Vegas  | 2369         |
+| Hash House A Go Go | NV                | Las Vegas  | 2201         |
+| Mesa Grill | NV         | Las Vegas  | 2004         |
++------------+------------+------------+--------------+
+</code></pre></div>
+<p><strong>Top restaurants in number of listed categories</strong></p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select name,repeated_count(categories) as categorycount, categories from dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json` where true=repeated_contains(categories,&#39;Restaurants&#39;) order by repeated_count(categories) desc limit 10;
++------------+---------------+------------+
+|    name         | categorycount | categories |
++------------+---------------+------------+
+| Binion&#39;s Hotel &amp; Casino | 10           | [&quot;Arts &amp; Entertainment&quot;,&quot;Restaurants&quot;,&quot;Bars&quot;,&quot;Casinos&quot;,&quot;Event Planning &amp; Services&quot;,&quot;Lounges&quot;,&quot;Nightlife&quot;,&quot;Hotels &amp; Travel&quot;,&quot;American (N |
+| Stage Deli | 10        | [&quot;Arts &amp; Entertainment&quot;,&quot;Food&quot;,&quot;Hotels&quot;,&quot;Desserts&quot;,&quot;Delis&quot;,&quot;Casinos&quot;,&quot;Sandwiches&quot;,&quot;Hotels &amp; Travel&quot;,&quot;Restaurants&quot;,&quot;Event Planning &amp; Services&quot;] |
+| Jillian&#39;s  | 9               | [&quot;Arts &amp; Entertainment&quot;,&quot;American (Traditional)&quot;,&quot;Music Venues&quot;,&quot;Bars&quot;,&quot;Dance Clubs&quot;,&quot;Nightlife&quot;,&quot;Bowling&quot;,&quot;Active Life&quot;,&quot;Restaurants&quot;] |
+| Hotel Chocolat | 9               | [&quot;Coffee &amp; Tea&quot;,&quot;Food&quot;,&quot;Cafes&quot;,&quot;Chocolatiers &amp; Shops&quot;,&quot;Specialty Food&quot;,&quot;Event Planning &amp; Services&quot;,&quot;Hotels &amp; Travel&quot;,&quot;Hotels&quot;,&quot;Restaurants&quot;] |
+| Hotel du Vin &amp; Bistro Edinburgh | 9           | [&quot;Modern European&quot;,&quot;Bars&quot;,&quot;French&quot;,&quot;Wine Bars&quot;,&quot;Event Planning &amp; Services&quot;,&quot;Nightlife&quot;,&quot;Hotels &amp; Travel&quot;,&quot;Hotels&quot;,&quot;Restaurants&quot; |
+| Elixir             | 9             | [&quot;Arts &amp; Entertainment&quot;,&quot;American (Traditional)&quot;,&quot;Music Venues&quot;,&quot;Bars&quot;,&quot;Cocktail Bars&quot;,&quot;Nightlife&quot;,&quot;American (New)&quot;,&quot;Local Flavor&quot;,&quot;Restaurants&quot;] |
+| Tocasierra Spa and Fitness | 8                  | [&quot;Beauty &amp; Spas&quot;,&quot;Gyms&quot;,&quot;Medical Spas&quot;,&quot;Health &amp; Medical&quot;,&quot;Fitness &amp; Instruction&quot;,&quot;Active Life&quot;,&quot;Day Spas&quot;,&quot;Restaurants&quot;] |
+| Costa Del Sol At Sunset Station | 8            | [&quot;Steakhouses&quot;,&quot;Mexican&quot;,&quot;Seafood&quot;,&quot;Event Planning &amp; Services&quot;,&quot;Hotels &amp; Travel&quot;,&quot;Italian&quot;,&quot;Restaurants&quot;,&quot;Hotels&quot;] |
+| Scottsdale Silverado Golf Club | 8              | [&quot;Fashion&quot;,&quot;Shopping&quot;,&quot;Sporting Goods&quot;,&quot;Active Life&quot;,&quot;Golf&quot;,&quot;American (New)&quot;,&quot;Sports Wear&quot;,&quot;Restaurants&quot;] |
+| House of Blues | 8               | [&quot;Arts &amp; Entertainment&quot;,&quot;Music Venues&quot;,&quot;Restaurants&quot;,&quot;Hotels&quot;,&quot;Event Planning &amp; Services&quot;,&quot;Hotels &amp; Travel&quot;,&quot;American (New)&quot;,&quot;Nightlife&quot;] |
++------------+---------------+------------+
+</code></pre></div>
+<h4 id="top-first-categories-in-number-of-review-counts"><strong>Top first categories in number of review counts</strong></h4>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select categories[0], count(categories[0]) as categorycount from dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` group by categories[0] 
+order by count(categories[0]) desc limit 10;
++------------+---------------+
+|   EXPR$0   | categorycount |
++------------+---------------+
+| Food       | 4294          |
+| Shopping   | 1885          |
+| Active Life | 1676          |
+| Bars       | 1366          |
+| Local Services | 1351          |
+| Mexican    | 1284          |
+| Hotels &amp; Travel | 1283          |
+| Fast Food  | 963           |
+| Arts &amp; Entertainment | 906           |
+| Hair Salons | 901           |
++------------+---------------+
+</code></pre></div>
+<p><strong>5. Explore the Yelp reviews dataset and combine with the businesses.</strong>** **</p>
+
+<h4 id="take-a-look-at-the-contents-of-the-yelp-reviews-dataset.**-**"><strong>Take a look at the contents of the Yelp reviews dataset.</strong>** **</h4>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select * from dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_review.json` limit 1;
++------------+------------+------------+------------+------------+------------+------------+-------------+
+|   votes          |  user_id   | review_id  |   stars    |            date    |    text           |          type    | business_id |
++------------+------------+------------+------------+------------+------------+------------+-------------+
+| {&quot;funny&quot;:0,&quot;useful&quot;:2,&quot;cool&quot;:1} | Xqd0DzHaiyRqVH3WRG7hzg | 15SdjuK7DmYqUAj6rjGowg | 5            | 2007-05-17 | dr. goldberg offers everything i look for in a general practitioner.  he&#39;s nice and easy to talk to without being patronizing; he&#39;s always on time in seeing his patients; he&#39;s affiliated with a top-notch hospital (nyu) which my parents have explained to me is very important in case something happens and you need surgery; and you can get referrals to see specialists without having to see him first.  really, what more do you need?  i&#39;m sitting here trying to think of any complaints i have about him, but i&#39;m really drawing a blank. | review | vcNAWiLM4dR7D2nwwJ7nCA |
++------------+------------+------------+------------+------------+------------+------------+-------------+
+</code></pre></div>
+<h4 id="top-businesses-with-cool-rated-reviews"><strong>Top businesses with cool rated reviews</strong></h4>
+
+<p>Note that we are combining the Yelp business data set that has the overall
+review_count to the Yelp review data, which holds additional details on each
+of the reviews themselves.</p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; Select b.name from dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json` b where b.business_id in (SELECT r.business_id FROM dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_review.json` r
+GROUP BY r.business_id having sum(r.votes.cool) &gt; 2000 order by sum(r.votes.cool)  desc);
++------------+
+|    name         |
++------------+
+| Earl of Sandwich |
+| XS Nightclub |
+| The Cosmopolitan of Las Vegas |
+| Wicked Spoon |
++------------+
+</code></pre></div>
+<p><strong>Create a view with the combined business and reviews data sets</strong></p>
+
+<p>Note that Drill views are lightweight, and can just be created in the local
+file system. Drill in standalone mode comes with a dfs.tmp workspace, which we
+can use to create views (or you can can define your own workspaces on a local
+or distributed file system). If you want to persist the data physically
+instead of in a logical view, you can use CREATE TABLE AS SELECT syntax.</p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; create or replace view dfs.tmp.businessreviews as Select b.name,b.stars,b.state,b.city,r.votes.funny,r.votes.useful,r.votes.cool, r.`date` from dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json` b , dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_review.json` r where r.business_id=b.business_id
++------------+------------+
+|     ok             |  summary   |
++------------+------------+
+| true              | View &#39;businessreviews&#39; created successfully in &#39;dfs.tmp&#39; schema |
++------------+------------+
+</code></pre></div>
+<p>Let’s get the total number of records from the view.</p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select count(*) as Total from dfs.tmp.businessreviews;
++------------+
+|   Total   |
++------------+
+| 1125458       |
++------------+
+</code></pre></div>
+<p>In addition to these queries, you can get many more deeper insights using
+Drill’s <a href="https://cwiki.apache.org/confluence/display/DRILL/%0ASQL+Reference">SQL functionality</a>. If you are not comfortable with writing queries manually, you
+can use a BI/Analytics tools such as Tableau/MicroStrategy to query raw
+files/Hive/HBase data or Drill-created views directly using Drill ODBC/JDBC
+drivers.</p>
+
+<p>The goal of Apache Drill is to provide the freedom and flexibility in
+exploring data in ways we have never seen before with SQL technologies. The
+community is working on more exciting features around nested data and
+supporting data with changing schemas in upcoming releases.</p>
+
+<p>As an example, a new FLATTEN function is in development (an upcoming feature
+in 0.7). This function can be used to dynamically rationalize semi-structured
+data so you can apply even deeper SQL functionality. Here is a sample query:</p>
+
+<h4 id="get-a-flattened-list-of-categories-for-each-business"><strong>Get a flattened list of categories for each business</strong></h4>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select name, flatten(categories) as category from dfs.`/users/nrentachintala/Downloads/yelp/yelp_academic_dataset_business.json`  limit 20;
++------------+------------+
+|    name         |   category   |
++------------+------------+
+| Eric Goldberg, MD | Doctors          |
+| Eric Goldberg, MD | Health &amp; Medical |
+| Pine Cone Restaurant | Restaurants |
+| Deforest Family Restaurant | American (Traditional) |
+| Deforest Family Restaurant | Restaurants |
+| Culver&#39;s   | Food       |
+| Culver&#39;s   | Ice Cream &amp; Frozen Yogurt |
+| Culver&#39;s   | Fast Food  |
+| Culver&#39;s   | Restaurants |
+| Chang Jiang Chinese Kitchen | Chinese    |
+| Chang Jiang Chinese Kitchen | Restaurants |
+| Charter Communications | Television Stations |
+| Charter Communications | Mass Media |
+| Air Quality Systems | Home Services |
+| Air Quality Systems | Heating &amp; Air Conditioning/HVAC |
+| McFarland Public Library | Libraries  |
+| McFarland Public Library | Public Services &amp; Government |
+| Green Lantern Restaurant | American (Traditional) |
+| Green Lantern Restaurant | Restaurants |
+| Spartan Animal Hospital | Veterinarians |
++------------+------------+
+</code></pre></div>
+<p><strong>Top categories used in business reviews</strong></p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">0: jdbc:drill:zk=local&gt; select celltbl.catl, count(celltbl.catl) categorycnt from (select flatten(categories) catl from dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` )  celltbl group by celltbl.catl order by count(celltbl.catl) desc limit 10 ;
++------------+-------------+
+|    catl    | categorycnt |
++------------+-------------+
+| Restaurants | 14303       |
+| Shopping   | 6428        |
+| Food       | 5209        |
+| Beauty &amp; Spas | 3421        |
+| Nightlife  | 2870        |
+| Bars       | 2378        |
+| Health &amp; Medical | 2351        |
+| Automotive | 2241        |
+| Home Services | 1957        |
+| Fashion    | 1897        |
++------------+-------------+
+</code></pre></div>
+<p>Stay tuned for more features and upcoming activities in the Drill community.</p>
+
+<p>To learn more about Drill, please refer to the following resources:</p>
+
+<ul>
+<li>Download Drill here:<a href="http://incubator.apache.org/drill/download/">http://incubator.apache.org/drill/download/</a></li>
+<li>10 reasons we think Drill is cool:<a href="http://incubator.apache.org/drill/why-drill/">http://incubator.apache.org/drill/why-drill/</a></li>
+<li>A simple 10-minute tutorial:<a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes">https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes</a></li>
+<li>A more comprehensive tutorial:<a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+Tutorial">https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+Tutorial</a></li>
+</ul>
+</div>
+
+
+<div id="footer" class="mw">
+<div class="wrapper">
+Copyright © 2012-2014 The Apache Software Foundation, licensed under the Apache License, Version 2.0.<br>
+Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.<br/><br/>
+</div>
+</div>
+
+<script>
+(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ga('create', 'UA-53379651-1', 'auto');
+ga('send', 'pageview');
+</script>
+
+</body>
+</html>

Added: drill/site/trunk/content/drill/docs/aol-search/index.html
URL: http://svn.apache.org/viewvc/drill/site/trunk/content/drill/docs/aol-search/index.html?rev=1651949&view=auto
==============================================================================
--- drill/site/trunk/content/drill/docs/aol-search/index.html (added)
+++ drill/site/trunk/content/drill/docs/aol-search/index.html Thu Jan 15 05:11:44 2015
@@ -0,0 +1,138 @@
+<!DOCTYPE html>
+<html>
+
+<head>
+
+<meta charset="UTF-8">
+
+
+<title>AOL Search - Apache Drill</title>
+
+<link href="/css/syntax.css" rel="stylesheet" type="text/css">
+<link href="/css/style.css" rel="stylesheet" type="text/css">
+<link href="/css/arrows.css" rel="stylesheet" type="text/css">
+<link href="/css/button.css" rel="stylesheet" type="text/css">
+
+<link rel="shortcut icon" href="/favicon.ico" type="image/x-icon">
+<link rel="icon" href="/favicon.ico" type="image/x-icon">
+
+<script language="javascript" type="text/javascript" src="/js/lib/jquery-1.11.1.min.js"></script>
+<script language="javascript" type="text/javascript" src="/js/lib/jquery.easing.1.3.js"></script>
+<script language="javascript" type="text/javascript" src="/js/modernizr.custom.js"></script>
+<script language="javascript" type="text/javascript" src="/js/script.js"></script>
+
+</head>
+
+<body onResize="resized();">
+
+<div class="bui"></div>
+
+<div id="search">
+<input type="text" placeholder="Enter search term here">
+</div>
+
+<div id="menu" class="mw">
+<ul>
+  <li class="logo"><a href="/"></a></li>
+  <li>
+    <a href="/overview/">Documentation</a>
+    <ul>
+      <li><a href="/overview/">Overview&nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes" target="_blank">Drill in 10 Minutes</a></li>
+      <li><a href="/why/">Why Drill? &nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="/architecture/">Architecture</a></li>
+    </ul>
+  </li>
+  <li>
+    <a href="/community/">Community</a>
+    <ul>
+      <li><a href="/team/">Team</a></li>
+      <li><a href="/community/#events">Events and Meetups</a></li>
+      <li><a href="/community/#mailinglists">Mailing Lists</a></li>
+      <li><a href="/community/#getinvolved">Get Involved</a></li>
+      <li><a href="https://issues.apache.org/jira/browse/DRILL/" target="_blank">Issue Tracker</a></li>
+      <li><a href="https://github.com/apache/drill" target="_blank">GitHub</a></li>
+    </ul>
+  </li>
+  <li><a href="/faq/">FAQ</a></li>
+  <li><a href="/blog/">Blog</a></li>
+  <li style="width:30px; padding-left: 2px; padding-right:10px"><a href="https://twitter.com/apachedrill" target="_blank"><img src="/images/twitterbw.png" alt="" align="center" width="22" style="padding: 0px 10px 1px 0px;"></a> </li>
+  <li class="l"><span>&nbsp;</span></li>
+  <li class="d"><a href="/download/">Download</a></li>
+</ul>
+</div>
+
+<div class="int_title">
+<h1>AOL Search</h1>
+
+</div>
+
+<div class="int_text" align="left"><h2 id="quick-stats">Quick Stats</h2>
+
+<p>The <a href="http://en.wikipedia.org/wiki/AOL_search_data_leak">AOL Search dataset</a> is
+a collection of real query log data that is based on real users.</p>
+
+<h2 id="the-data-source">The Data Source</h2>
+
+<p>The dataset consists of 20M Web queries from 650k users over a period of three
+months, 440MB in total and available <a href="http://zola.di.unipi.it/smalltext/datasets.html">for
+download</a>. The format used in
+the dataset is:</p>
+<div class="highlight"><pre><code class="language-text" data-lang="text">AnonID, Query, QueryTime, ItemRank, ClickURL
+</code></pre></div>
+<p>... with:</p>
+
+<ul>
+<li>AnonID, an anonymous user ID number.</li>
+<li>Query, the query issued by the user, case shifted with most punctuation removed.</li>
+<li>QueryTime, the time at which the query was submitted for search.</li>
+<li>ItemRank, if the user clicked on a search result, the rank of the item on which they clicked is listed.</li>
+<li><a href="http://www.dietkart.com/">ClickURL</a>, if the user clicked on a search result, the domain portion of the URL in the clicked result is listed.</li>
+</ul>
+
+<p>Each line in the data represents one of two types of events</p>
+
+<ul>
+<li>A query that was NOT followed by the user clicking on a result item.</li>
+<li>A click through on an item in the result list returned from a query.</li>
+</ul>
+
+<p>In the first case (query only) there is data in only the first three columns,
+in the second case (click through), there is data in all five columns. For
+click through events, the query that preceded the click through is included.
+Note that if a user clicked on more than one result in the list returned from
+a single query, there will be TWO lines in the data to represent the two
+events.</p>
+
+<h2 id="the-queries">The Queries</h2>
+
+<p>Interesting queries, for example</p>
+
+<ul>
+<li>Users querying for topic X</li>
+<li>Users that click on the first (second, third) ranked item</li>
+<li>TOP 10 domains searched</li>
+<li>TOP 10 domains clicked at</li>
+</ul>
+</div>
+
+
+<div id="footer" class="mw">
+<div class="wrapper">
+Copyright © 2012-2014 The Apache Software Foundation, licensed under the Apache License, Version 2.0.<br>
+Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.<br/><br/>
+</div>
+</div>
+
+<script>
+(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ga('create', 'UA-53379651-1', 'auto');
+ga('send', 'pageview');
+</script>
+
+</body>
+</html>

Added: drill/site/trunk/content/drill/docs/apache-drill-0-4-0-release-notes/index.html
URL: http://svn.apache.org/viewvc/drill/site/trunk/content/drill/docs/apache-drill-0-4-0-release-notes/index.html?rev=1651949&view=auto
==============================================================================
--- drill/site/trunk/content/drill/docs/apache-drill-0-4-0-release-notes/index.html (added)
+++ drill/site/trunk/content/drill/docs/apache-drill-0-4-0-release-notes/index.html Thu Jan 15 05:11:44 2015
@@ -0,0 +1,127 @@
+<!DOCTYPE html>
+<html>
+
+<head>
+
+<meta charset="UTF-8">
+
+
+<title>Apache Drill 0.4.0 Release Notes - Apache Drill</title>
+
+<link href="/css/syntax.css" rel="stylesheet" type="text/css">
+<link href="/css/style.css" rel="stylesheet" type="text/css">
+<link href="/css/arrows.css" rel="stylesheet" type="text/css">
+<link href="/css/button.css" rel="stylesheet" type="text/css">
+
+<link rel="shortcut icon" href="/favicon.ico" type="image/x-icon">
+<link rel="icon" href="/favicon.ico" type="image/x-icon">
+
+<script language="javascript" type="text/javascript" src="/js/lib/jquery-1.11.1.min.js"></script>
+<script language="javascript" type="text/javascript" src="/js/lib/jquery.easing.1.3.js"></script>
+<script language="javascript" type="text/javascript" src="/js/modernizr.custom.js"></script>
+<script language="javascript" type="text/javascript" src="/js/script.js"></script>
+
+</head>
+
+<body onResize="resized();">
+
+<div class="bui"></div>
+
+<div id="search">
+<input type="text" placeholder="Enter search term here">
+</div>
+
+<div id="menu" class="mw">
+<ul>
+  <li class="logo"><a href="/"></a></li>
+  <li>
+    <a href="/overview/">Documentation</a>
+    <ul>
+      <li><a href="/overview/">Overview&nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes" target="_blank">Drill in 10 Minutes</a></li>
+      <li><a href="/why/">Why Drill? &nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="/architecture/">Architecture</a></li>
+    </ul>
+  </li>
+  <li>
+    <a href="/community/">Community</a>
+    <ul>
+      <li><a href="/team/">Team</a></li>
+      <li><a href="/community/#events">Events and Meetups</a></li>
+      <li><a href="/community/#mailinglists">Mailing Lists</a></li>
+      <li><a href="/community/#getinvolved">Get Involved</a></li>
+      <li><a href="https://issues.apache.org/jira/browse/DRILL/" target="_blank">Issue Tracker</a></li>
+      <li><a href="https://github.com/apache/drill" target="_blank">GitHub</a></li>
+    </ul>
+  </li>
+  <li><a href="/faq/">FAQ</a></li>
+  <li><a href="/blog/">Blog</a></li>
+  <li style="width:30px; padding-left: 2px; padding-right:10px"><a href="https://twitter.com/apachedrill" target="_blank"><img src="/images/twitterbw.png" alt="" align="center" width="22" style="padding: 0px 10px 1px 0px;"></a> </li>
+  <li class="l"><span>&nbsp;</span></li>
+  <li class="d"><a href="/download/">Download</a></li>
+</ul>
+</div>
+
+<div class="int_title">
+<h1>Apache Drill 0.4.0 Release Notes</h1>
+
+</div>
+
+<div class="int_text" align="left"><p>The 0.4.0 release is a developer preview release, designed to help enthusiasts
+start to work with and experiment with Drill. It is the first Drill release
+that provides distributed query execution.</p>
+
+<p>This release is built upon <a href="https://issues.apache.org/jira/browse/DRILL/fixforversion/12324963/">more than 800
+JIRAs</a>.
+It is a pre-beta release on the way towards Drill. As a developer snapshot,
+the release contains a large number of outstanding bugs that will make some
+use cases challenging. Feel free to consult outstanding issues <a href="https://issues.apache.org/jira/browse/DRILL/fixforversion/12324880/">targeted for
+the 0.5.0
+release</a>
+to see whether your use case is affected.</p>
+
+<p>To read more about this release and new features introduced, please view the
+<a href="https://blogs.apache.org/drill/entry/announcing_apache_drill_0_4">0.4.0 announcement blog
+entry</a>.</p>
+
+<p>The release is available as both <a href="http://www.apache.org/dyn/closer.cgi%0A/incubator/drill/drill-0.4.0-incubating/apache-drill-0.4.0-incubating.tar.gz">binary</a>
+and <a href="http://www.apache.org/dyn/closer.cgi/incubator/drill/drill-0.4.0-%0Aincubating/apache-drill-0.4.0-incubating-src.tar.gz">source</a> tarballs. In both cases,
+these are compiled against Apache Hadoop. Drill has also been tested against
+MapR, Cloudera and Hortonworks Hadoop distributions and there are associated
+build profiles or JIRAs that can help you run against your preferred
+distribution.</p>
+
+<p>Some Key Notes &amp; Limitations</p>
+
+<ul>
+<li>The current release supports in memory and beyond memory execution. However, users must disable memory-intensive hash aggregate and hash join operations to leverage this functionality.</li>
+<li>In many cases,merge join operations return incorrect results.</li>
+<li>Use of a local filter in a join “on” clause when using left, right or full outer joins may result in incorrect results.</li>
+<li>Because of known memory leaks and memory overrun issues you may need more memory and you may need to restart the system in some cases.</li>
+<li>Some types of complex expressions, especially those involving empty arrays may fail or return incorrect results.</li>
+<li>While the Drill execution engine supports dynamic schema changes during the course of a query, some operators have yet to implement support for this behavior (such as Sort). Others operations (such as streaming aggregate) may have partial support that leads to unexpected results.</li>
+<li>Protobuf, UDF, query plan interfaces and all interfaces are subject to change in incompatible ways.</li>
+<li>Multiplication of some types of DECIMAL(28+,*) will return incorrect result.</li>
+</ul>
+</div>
+
+
+<div id="footer" class="mw">
+<div class="wrapper">
+Copyright © 2012-2014 The Apache Software Foundation, licensed under the Apache License, Version 2.0.<br>
+Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.<br/><br/>
+</div>
+</div>
+
+<script>
+(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ga('create', 'UA-53379651-1', 'auto');
+ga('send', 'pageview');
+</script>
+
+</body>
+</html>

Added: drill/site/trunk/content/drill/docs/apache-drill-0-5-0-release-notes/index.html
URL: http://svn.apache.org/viewvc/drill/site/trunk/content/drill/docs/apache-drill-0-5-0-release-notes/index.html?rev=1651949&view=auto
==============================================================================
--- drill/site/trunk/content/drill/docs/apache-drill-0-5-0-release-notes/index.html (added)
+++ drill/site/trunk/content/drill/docs/apache-drill-0-5-0-release-notes/index.html Thu Jan 15 05:11:44 2015
@@ -0,0 +1,112 @@
+<!DOCTYPE html>
+<html>
+
+<head>
+
+<meta charset="UTF-8">
+
+
+<title>Apache Drill 0.5.0 Release Notes - Apache Drill</title>
+
+<link href="/css/syntax.css" rel="stylesheet" type="text/css">
+<link href="/css/style.css" rel="stylesheet" type="text/css">
+<link href="/css/arrows.css" rel="stylesheet" type="text/css">
+<link href="/css/button.css" rel="stylesheet" type="text/css">
+
+<link rel="shortcut icon" href="/favicon.ico" type="image/x-icon">
+<link rel="icon" href="/favicon.ico" type="image/x-icon">
+
+<script language="javascript" type="text/javascript" src="/js/lib/jquery-1.11.1.min.js"></script>
+<script language="javascript" type="text/javascript" src="/js/lib/jquery.easing.1.3.js"></script>
+<script language="javascript" type="text/javascript" src="/js/modernizr.custom.js"></script>
+<script language="javascript" type="text/javascript" src="/js/script.js"></script>
+
+</head>
+
+<body onResize="resized();">
+
+<div class="bui"></div>
+
+<div id="search">
+<input type="text" placeholder="Enter search term here">
+</div>
+
+<div id="menu" class="mw">
+<ul>
+  <li class="logo"><a href="/"></a></li>
+  <li>
+    <a href="/overview/">Documentation</a>
+    <ul>
+      <li><a href="/overview/">Overview&nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes" target="_blank">Drill in 10 Minutes</a></li>
+      <li><a href="/why/">Why Drill? &nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="/architecture/">Architecture</a></li>
+    </ul>
+  </li>
+  <li>
+    <a href="/community/">Community</a>
+    <ul>
+      <li><a href="/team/">Team</a></li>
+      <li><a href="/community/#events">Events and Meetups</a></li>
+      <li><a href="/community/#mailinglists">Mailing Lists</a></li>
+      <li><a href="/community/#getinvolved">Get Involved</a></li>
+      <li><a href="https://issues.apache.org/jira/browse/DRILL/" target="_blank">Issue Tracker</a></li>
+      <li><a href="https://github.com/apache/drill" target="_blank">GitHub</a></li>
+    </ul>
+  </li>
+  <li><a href="/faq/">FAQ</a></li>
+  <li><a href="/blog/">Blog</a></li>
+  <li style="width:30px; padding-left: 2px; padding-right:10px"><a href="https://twitter.com/apachedrill" target="_blank"><img src="/images/twitterbw.png" alt="" align="center" width="22" style="padding: 0px 10px 1px 0px;"></a> </li>
+  <li class="l"><span>&nbsp;</span></li>
+  <li class="d"><a href="/download/">Download</a></li>
+</ul>
+</div>
+
+<div class="int_title">
+<h1>Apache Drill 0.5.0 Release Notes</h1>
+
+</div>
+
+<div class="int_text" align="left"><p>Apache Drill 0.5.0, the first beta release for Drill, is designed to help
+enthusiasts start working and experimenting with Drill. It also continues the
+Drill monthly release cycle as we drive towards general availability.</p>
+
+<p>The 0.5.0 release is primarily a bug fix release, with <a href="h%0Attps://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313820&amp;versi%0Aon=12324880">more than 100 JIRAs</a> closed, but there are some notable features. For information
+about the features, see the <a href="https://blogs.apache.org/drill/entry/apache_drill_beta_release_see">Apache Drill Blog for the 0.5.0
+release</a>.</p>
+
+<p>This release is available as <a href="http://www.apache.org/dyn/closer.cgi/inc%0Aubator/drill/drill-0.5.0-incubating/apache-drill-0.5.0-incubating.tar.gz">binary</a> and 
+<a href="http://www.apache.org/dyn/closer.cgi/incubator/drill/drill-0.5.0-incu%0Abating/apache-drill-0.5.0-incubating-src.tar.gz">source</a> tarballs that are compiled
+against Apache Hadoop. Drill has been tested against MapR, Cloudera, and
+Hortonworks Hadoop distributions. There are associated build profiles and
+JIRAs that can help you run Drill against your preferred distribution.</p>
+
+<p>Apache Drill 0.5.0 Key Notes and Limitations</p>
+
+<ul>
+<li>The current release supports in memory and beyond memory execution. However, you must disable memory-intensive hash aggregate and hash join operations to leverage this functionality.</li>
+<li>While the Drill execution engine supports dynamic schema changes during the course of a query, some operators have yet to implement support for this behavior, such as Sort. Others operations, such as streaming aggregate, may have partial support that leads to unexpected results.</li>
+<li>There are known issues with joining text files without using an intervening view. See <a href="https://issues.apache.org/jira/browse/DRILL-1401">DRILL-1401</a> for more information.</li>
+</ul>
+</div>
+
+
+<div id="footer" class="mw">
+<div class="wrapper">
+Copyright © 2012-2014 The Apache Software Foundation, licensed under the Apache License, Version 2.0.<br>
+Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.<br/><br/>
+</div>
+</div>
+
+<script>
+(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ga('create', 'UA-53379651-1', 'auto');
+ga('send', 'pageview');
+</script>
+
+</body>
+</html>

Added: drill/site/trunk/content/drill/docs/apache-drill-0-6-0-release-notes-apache-drill-alpha/index.html
URL: http://svn.apache.org/viewvc/drill/site/trunk/content/drill/docs/apache-drill-0-6-0-release-notes-apache-drill-alpha/index.html?rev=1651949&view=auto
==============================================================================
--- drill/site/trunk/content/drill/docs/apache-drill-0-6-0-release-notes-apache-drill-alpha/index.html (added)
+++ drill/site/trunk/content/drill/docs/apache-drill-0-6-0-release-notes-apache-drill-alpha/index.html Thu Jan 15 05:11:44 2015
@@ -0,0 +1,118 @@
+<!DOCTYPE html>
+<html>
+
+<head>
+
+<meta charset="UTF-8">
+
+
+<title>Apache Drill 0.6.0 Release Notes (Apache Drill Alpha) - Apache Drill</title>
+
+<link href="/css/syntax.css" rel="stylesheet" type="text/css">
+<link href="/css/style.css" rel="stylesheet" type="text/css">
+<link href="/css/arrows.css" rel="stylesheet" type="text/css">
+<link href="/css/button.css" rel="stylesheet" type="text/css">
+
+<link rel="shortcut icon" href="/favicon.ico" type="image/x-icon">
+<link rel="icon" href="/favicon.ico" type="image/x-icon">
+
+<script language="javascript" type="text/javascript" src="/js/lib/jquery-1.11.1.min.js"></script>
+<script language="javascript" type="text/javascript" src="/js/lib/jquery.easing.1.3.js"></script>
+<script language="javascript" type="text/javascript" src="/js/modernizr.custom.js"></script>
+<script language="javascript" type="text/javascript" src="/js/script.js"></script>
+
+</head>
+
+<body onResize="resized();">
+
+<div class="bui"></div>
+
+<div id="search">
+<input type="text" placeholder="Enter search term here">
+</div>
+
+<div id="menu" class="mw">
+<ul>
+  <li class="logo"><a href="/"></a></li>
+  <li>
+    <a href="/overview/">Documentation</a>
+    <ul>
+      <li><a href="/overview/">Overview&nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes" target="_blank">Drill in 10 Minutes</a></li>
+      <li><a href="/why/">Why Drill? &nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="/architecture/">Architecture</a></li>
+    </ul>
+  </li>
+  <li>
+    <a href="/community/">Community</a>
+    <ul>
+      <li><a href="/team/">Team</a></li>
+      <li><a href="/community/#events">Events and Meetups</a></li>
+      <li><a href="/community/#mailinglists">Mailing Lists</a></li>
+      <li><a href="/community/#getinvolved">Get Involved</a></li>
+      <li><a href="https://issues.apache.org/jira/browse/DRILL/" target="_blank">Issue Tracker</a></li>
+      <li><a href="https://github.com/apache/drill" target="_blank">GitHub</a></li>
+    </ul>
+  </li>
+  <li><a href="/faq/">FAQ</a></li>
+  <li><a href="/blog/">Blog</a></li>
+  <li style="width:30px; padding-left: 2px; padding-right:10px"><a href="https://twitter.com/apachedrill" target="_blank"><img src="/images/twitterbw.png" alt="" align="center" width="22" style="padding: 0px 10px 1px 0px;"></a> </li>
+  <li class="l"><span>&nbsp;</span></li>
+  <li class="d"><a href="/download/">Download</a></li>
+</ul>
+</div>
+
+<div class="int_title">
+<h1>Apache Drill 0.6.0 Release Notes (Apache Drill Alpha)</h1>
+
+</div>
+
+<div class="int_text" align="left"><p>Apache Drill 0.6.0, the second beta release for Drill, is designed to help
+enthusiasts start working and experimenting with Drill. It also continues the
+Drill monthly release cycle as we drive towards general availability.</p>
+
+<p>This release is available as <a href="http://www.apache.org/dyn/closer.cgi/inc%0Aubator/drill/drill-0.5.0-incubating/apache-drill-0.5.0-incubating.tar.gz">binary</a> and 
+<a href="http://www.apache.org/dyn/closer.cgi/incubator/drill/drill-0.5.0-incu%0Abating/apache-drill-0.5.0-incubating-src.tar.gz">source</a> tarballs that are compiled
+against Apache Hadoop. Drill has been tested against MapR, Cloudera, and
+Hortonworks Hadoop distributions. There are associated build profiles and
+JIRAs that can help you run Drill against your preferred distribution.</p>
+
+<p>Apache Drill 0.6.0 Key Features</p>
+
+<p>This release is primarily a bug fix release, with <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313820&amp;vers%0Aion=12327472">more than 30 JIRAs closed</a>, but there are some notable features:</p>
+
+<ul>
+<li>Direct ANSI SQL access to MongoDB, using the latest <a href="/confluence/display/DRILL/MongoDB+Plugin+for+Apache+Drill">MongoDB Plugin for Apache Drill</a></li>
+<li>Filesystem query performance improvements with partition pruning</li>
+<li>Ability to use the file system as a persistent store for query profiles and diagnostic information</li>
+<li>Window function support (alpha)</li>
+</ul>
+
+<p>Apache Drill 0.6.0 Key Notes and Limitations</p>
+
+<ul>
+<li>The current release supports in-memory and beyond-memory execution. However, you must disable memory-intensive hash aggregate and hash join operations to leverage this functionality.</li>
+<li>While the Drill execution engine supports dynamic schema changes during the course of a query, some operators have yet to implement support for this behavior, such as Sort. Other operations, such as streaming aggregate, may have partial support that leads to unexpected results.</li>
+</ul>
+</div>
+
+
+<div id="footer" class="mw">
+<div class="wrapper">
+Copyright © 2012-2014 The Apache Software Foundation, licensed under the Apache License, Version 2.0.<br>
+Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.<br/><br/>
+</div>
+</div>
+
+<script>
+(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ga('create', 'UA-53379651-1', 'auto');
+ga('send', 'pageview');
+</script>
+
+</body>
+</html>

Added: drill/site/trunk/content/drill/docs/apache-drill-0-7-0-release-notes-apache-drill-alpha/index.html
URL: http://svn.apache.org/viewvc/drill/site/trunk/content/drill/docs/apache-drill-0-7-0-release-notes-apache-drill-alpha/index.html?rev=1651949&view=auto
==============================================================================
--- drill/site/trunk/content/drill/docs/apache-drill-0-7-0-release-notes-apache-drill-alpha/index.html (added)
+++ drill/site/trunk/content/drill/docs/apache-drill-0-7-0-release-notes-apache-drill-alpha/index.html Thu Jan 15 05:11:44 2015
@@ -0,0 +1,132 @@
+<!DOCTYPE html>
+<html>
+
+<head>
+
+<meta charset="UTF-8">
+
+
+<title>Apache Drill 0.7.0 Release Notes (Apache Drill Alpha) - Apache Drill</title>
+
+<link href="/css/syntax.css" rel="stylesheet" type="text/css">
+<link href="/css/style.css" rel="stylesheet" type="text/css">
+<link href="/css/arrows.css" rel="stylesheet" type="text/css">
+<link href="/css/button.css" rel="stylesheet" type="text/css">
+
+<link rel="shortcut icon" href="/favicon.ico" type="image/x-icon">
+<link rel="icon" href="/favicon.ico" type="image/x-icon">
+
+<script language="javascript" type="text/javascript" src="/js/lib/jquery-1.11.1.min.js"></script>
+<script language="javascript" type="text/javascript" src="/js/lib/jquery.easing.1.3.js"></script>
+<script language="javascript" type="text/javascript" src="/js/modernizr.custom.js"></script>
+<script language="javascript" type="text/javascript" src="/js/script.js"></script>
+
+</head>
+
+<body onResize="resized();">
+
+<div class="bui"></div>
+
+<div id="search">
+<input type="text" placeholder="Enter search term here">
+</div>
+
+<div id="menu" class="mw">
+<ul>
+  <li class="logo"><a href="/"></a></li>
+  <li>
+    <a href="/overview/">Documentation</a>
+    <ul>
+      <li><a href="/overview/">Overview&nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes" target="_blank">Drill in 10 Minutes</a></li>
+      <li><a href="/why/">Why Drill? &nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="/architecture/">Architecture</a></li>
+    </ul>
+  </li>
+  <li>
+    <a href="/community/">Community</a>
+    <ul>
+      <li><a href="/team/">Team</a></li>
+      <li><a href="/community/#events">Events and Meetups</a></li>
+      <li><a href="/community/#mailinglists">Mailing Lists</a></li>
+      <li><a href="/community/#getinvolved">Get Involved</a></li>
+      <li><a href="https://issues.apache.org/jira/browse/DRILL/" target="_blank">Issue Tracker</a></li>
+      <li><a href="https://github.com/apache/drill" target="_blank">GitHub</a></li>
+    </ul>
+  </li>
+  <li><a href="/faq/">FAQ</a></li>
+  <li><a href="/blog/">Blog</a></li>
+  <li style="width:30px; padding-left: 2px; padding-right:10px"><a href="https://twitter.com/apachedrill" target="_blank"><img src="/images/twitterbw.png" alt="" align="center" width="22" style="padding: 0px 10px 1px 0px;"></a> </li>
+  <li class="l"><span>&nbsp;</span></li>
+  <li class="d"><a href="/download/">Download</a></li>
+</ul>
+</div>
+
+<div class="int_title">
+<h1>Apache Drill 0.7.0 Release Notes (Apache Drill Alpha)</h1>
+
+</div>
+
+<div class="int_text" align="left"><p>Apache Drill 0.7.0, the third beta release for Drill, is designed to help
+enthusiasts start working and experimenting with Drill. It also continues the
+Drill monthly release cycle as we drive towards general availability.</p>
+
+<p>This release is available as
+<a href="http://www.apache.org/dyn/closer.cgi/drill/drill-0.7.0/apache-%0Adrill-0.7.0.tar.gz">binary</a> and
+<a href="http://www.apache.org/dyn/closer.cgi/drill/drill-0.7.0/apache-%0Adrill-0.7.0-src.tar.gz">source</a> tarballs that are compiled against Apache Hadoop.
+Drill has been tested against MapR, Cloudera, and Hortonworks Hadoop
+distributions. There are associated build profiles and JIRAs that can help you
+run Drill against your preferred distribution</p>
+
+<p>Apache Drill 0.7.0 Key Features</p>
+
+<ul>
+<li><p>No more dependency on UDP/Multicast - Making it possible for Drill to work well in the following scenarios:</p>
+
+<ul>
+<li>UDP multicast not enabled (as in EC2)</li>
+<li>Cluster spans multiple subnets</li>
+<li>Cluster has multihome configuration</li>
+</ul></li>
+<li><p>New functions to natively work with nested data - KVGen and Flatten </p></li>
+<li><p>Support for Hive 0.13 (Hive 0.12 with Drill is not supported any more) </p></li>
+<li><p>Improved performance when querying Hive tables and File system through partition pruning</p></li>
+<li><p>Improved performance for HBase with LIKE operator pushdown</p></li>
+<li><p>Improved memory management</p></li>
+<li><p>Drill web UI monitoring and query profile improvements</p></li>
+<li><p>Ability to parse files without explicit extensions using default storage format specification</p></li>
+<li><p>Fixes for dealing with complex/nested data objects in Parquet/JSON</p></li>
+<li><p>Fast schema return - Improved experience working with BI/query tools by returning metadata quickly</p></li>
+<li><p>Several hang related fixes</p></li>
+<li><p>Parquet writer fixes for handling large datasets</p></li>
+<li><p>Stability improvements in ODBC and JDBC drivers</p></li>
+</ul>
+
+<p>Apache Drill 0.7.0 Key Notes and Limitations</p>
+
+<ul>
+<li>The current release supports in-memory and beyond-memory execution. However, you must disable memory-intensive hash aggregate and hash join operations to leverage this functionality.</li>
+<li>While the Drill execution engine supports dynamic schema changes during the course of a query, some operators have yet to implement support for this behavior, such as Sort. Other operations, such as streaming aggregate, may have partial support that leads to unexpected results.</li>
+</ul>
+</div>
+
+
+<div id="footer" class="mw">
+<div class="wrapper">
+Copyright © 2012-2014 The Apache Software Foundation, licensed under the Apache License, Version 2.0.<br>
+Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.<br/><br/>
+</div>
+</div>
+
+<script>
+(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ga('create', 'UA-53379651-1', 'auto');
+ga('send', 'pageview');
+</script>
+
+</body>
+</html>

Added: drill/site/trunk/content/drill/docs/apache-drill-contribution-guidelines/index.html
URL: http://svn.apache.org/viewvc/drill/site/trunk/content/drill/docs/apache-drill-contribution-guidelines/index.html?rev=1651949&view=auto
==============================================================================
--- drill/site/trunk/content/drill/docs/apache-drill-contribution-guidelines/index.html (added)
+++ drill/site/trunk/content/drill/docs/apache-drill-contribution-guidelines/index.html Thu Jan 15 05:11:44 2015
@@ -0,0 +1,256 @@
+<!DOCTYPE html>
+<html>
+
+<head>
+
+<meta charset="UTF-8">
+
+
+<title>Apache Drill Contribution Guidelines - Apache Drill</title>
+
+<link href="/css/syntax.css" rel="stylesheet" type="text/css">
+<link href="/css/style.css" rel="stylesheet" type="text/css">
+<link href="/css/arrows.css" rel="stylesheet" type="text/css">
+<link href="/css/button.css" rel="stylesheet" type="text/css">
+
+<link rel="shortcut icon" href="/favicon.ico" type="image/x-icon">
+<link rel="icon" href="/favicon.ico" type="image/x-icon">
+
+<script language="javascript" type="text/javascript" src="/js/lib/jquery-1.11.1.min.js"></script>
+<script language="javascript" type="text/javascript" src="/js/lib/jquery.easing.1.3.js"></script>
+<script language="javascript" type="text/javascript" src="/js/modernizr.custom.js"></script>
+<script language="javascript" type="text/javascript" src="/js/script.js"></script>
+
+</head>
+
+<body onResize="resized();">
+
+<div class="bui"></div>
+
+<div id="search">
+<input type="text" placeholder="Enter search term here">
+</div>
+
+<div id="menu" class="mw">
+<ul>
+  <li class="logo"><a href="/"></a></li>
+  <li>
+    <a href="/overview/">Documentation</a>
+    <ul>
+      <li><a href="/overview/">Overview&nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes" target="_blank">Drill in 10 Minutes</a></li>
+      <li><a href="/why/">Why Drill? &nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="/architecture/">Architecture</a></li>
+    </ul>
+  </li>
+  <li>
+    <a href="/community/">Community</a>
+    <ul>
+      <li><a href="/team/">Team</a></li>
+      <li><a href="/community/#events">Events and Meetups</a></li>
+      <li><a href="/community/#mailinglists">Mailing Lists</a></li>
+      <li><a href="/community/#getinvolved">Get Involved</a></li>
+      <li><a href="https://issues.apache.org/jira/browse/DRILL/" target="_blank">Issue Tracker</a></li>
+      <li><a href="https://github.com/apache/drill" target="_blank">GitHub</a></li>
+    </ul>
+  </li>
+  <li><a href="/faq/">FAQ</a></li>
+  <li><a href="/blog/">Blog</a></li>
+  <li style="width:30px; padding-left: 2px; padding-right:10px"><a href="https://twitter.com/apachedrill" target="_blank"><img src="/images/twitterbw.png" alt="" align="center" width="22" style="padding: 0px 10px 1px 0px;"></a> </li>
+  <li class="l"><span>&nbsp;</span></li>
+  <li class="d"><a href="/download/">Download</a></li>
+</ul>
+</div>
+
+<div class="int_title">
+<h1>Apache Drill Contribution Guidelines</h1>
+
+</div>
+
+<div class="int_text" align="left"><ul>
+<li>Fixing JIRAs</li>
+<li>SQL functions </li>
+<li>Support for new file format readers/writers</li>
+<li>Support for new data sources</li>
+<li>New query language parsers</li>
+<li>Application interfaces
+
+<ul>
+<li>BI Tool testing</li>
+</ul></li>
+<li>General CLI improvements </li>
+<li>Eco system integrations
+
+<ul>
+<li>MapReduce</li>
+<li>Hive views</li>
+<li>YARN</li>
+<li>Spark</li>
+<li>Hue</li>
+<li>Phoenix</li>
+</ul></li>
+</ul>
+
+<h1 id="fixing-jiras">Fixing JIRAs</h1>
+
+<p>This is a good place to begin if you are new to Drill. Feel free to pick
+issues from the Drill JIRA list. When you pick an issue, assign it to
+yourself, inform the team, and start fixing it.</p>
+
+<p>For any questions, seek help from the team by sending email to <a href="mailto:drill-dev@incubator.apache.org">drill-
+dev@incubator.apache.org</a>.</p>
+
+<p><a href="https://issues.apache.org/jira/browse/DRILL/?selectedTab=com.atlassian.jira%0A.jira-projects-plugin:summary-panel">https://issues.apache.org/jira/browse/DRILL/?selectedTab=com.atlassian.jira
+.jira-projects-plugin:summary-panel</a></p>
+
+<h1 id="sql-functions">SQL functions</h1>
+
+<p>One of the next simple places to start is to implement a DrillFunc.
DrillFuncs
+is way that Drill express all scalar functions (UDF or system).
 First you can
+put together a JIRA for one of the DrillFunc&#39;s we don&#39;t yet have but should
+(referencing the capabilities of something like Postgres
or SQL Server or your
+own use case). Then try to implement one.</p>
+
+<p>One example DrillFunc:<br>
+<a href="https://github.com/apache/incubator-%0Adrill/blob/103072a619741d5e228fdb181501ec2f82e111a3/sandbox/prototype/exec%0A/java-exec/src/main/java/org/apache/drill/exec/expr/fn/impl/ComparisonFunction%0As.java">https://github.com/apache/incubator-
+drill/blob/103072a619741d5e228fdb181501ec2f82e111a3/sandbox/prototype/exec
+/java-exec/src/main/java/org/apache/drill/exec/expr/fn/impl/ComparisonFunction
+s.java</a>** **</p>
+
+<p><strong>Additional ideas on functions that can be added to Drill SQL support</strong></p>
+
+<ul>
+<li>Madlib integration</li>
+<li>Machine learning functions</li>
+<li>Approximate aggregate functions (such as what is available in BlinkDB)</li>
+</ul>
+
+<h1 id="support-for-new-file-format-readers/writers">Support for new file format readers/writers</h1>
+
+<p>Currently Drill supports text, JSON and Parquet file formats natively when
+interacting with file system. More readers/writers can be introduced by
+implementing custom storage plugins. Example formats include below.</p>
+
+<ul>
+<li>AVRO</li>
+<li>Sequence</li>
+<li>RC</li>
+<li>ORC</li>
+<li>Protobuf</li>
+<li>XML</li>
+<li>Thrift</li>
+<li>....</li>
+</ul>
+
+<h1 id="support-for-new-data-sources">Support for new data sources</h1>
+
+<p>Implement custom storage plugins for the following non-Hadoop data sources:</p>
+
+<ul>
+<li>NoSQL databases (such as Mongo, Cassandra, Couch etc)</li>
+<li>Search engines (such as Solr, Lucidworks, Elastic Search etc)</li>
+<li>SQL databases (MySQL&lt; PostGres etc)</li>
+<li>Generic JDBC/ODBC data sources</li>
+<li>HTTP URL</li>
+<li>----</li>
+</ul>
+
+<h1 id="new-query-language-parsers">New query language parsers</h1>
+
+<p>Drill exposes strongly typed JSON APIs for logical and physical plans (plan
+syntax at <a href="https://docs.google.com/a/maprtech.com/document/d/1QTL8warUYS2KjldQ%0ArGUse7zp8eA72VKtLOHwfXy6c7I/edit#heading=h.n9gdb1ek71hf">https://docs.google.com/a/maprtech.com/document/d/1QTL8warUYS2KjldQ
+rGUse7zp8eA72VKtLOHwfXy6c7I/edit#heading=h.n9gdb1ek71hf</a> ). Drill provides a
+SQL language parser today, but any language parser that can generate
+logical/physical plans can use Drill&#39;s power on the backend as the distributed
+low latency query execution engine along with its support for self-describing
+data and complex/multi-structured data.</p>
+
+<ul>
+<li>Pig parser : Use Pig as the language to query data from Drill. Great for existing Pig users.</li>
+<li>Hive parser : Use HiveQL as the language to query data from Drill. Great for existing Hive users.</li>
+</ul>
+
+<h1 id="application-interfaces">Application interfaces</h1>
+
+<p>Drill currently provides JDBC/ODBC drivers for the applications to interact
+along with a basic version of REST API and a C++ API. The following list
+provides a few possible application interface opportunities:</p>
+
+<ul>
+<li>Enhancements to REST APIs (<a href="https://issues.apache.org/jira/browse/DRILL-77">https://issues.apache.org/jira/browse/DRILL-77</a>)</li>
+<li>Expose Drill tables/views as REST APIs</li>
+<li>Language drivers for Drill (python etc)</li>
+<li>Thrift support</li>
+<li>....</li>
+</ul>
+
+<h2 id="bi-tool-testing">BI Tool testing</h2>
+
+<p>Drill provides JDBC/ODBC drivers to connect to BI tools. We need to make sure
+Drill works with all major BI tools. Doing a quick sanity testing with your
+favorite BI tool is a good place to learn Drill and also uncover issues in
+being able to do so.</p>
+
+<h1 id="general-cli-improvements">General CLI improvements</h1>
+
+<p>Currently Drill uses SQLLine as the CLI. The goal of this effort is to improve
+the CLI experience by adding functionality such as execute statements from a
+file, output results to a file, display version information, and so on.</p>
+
+<h1 id="eco-system-integrations">Eco system integrations</h1>
+
+<h2 id="mapreduce">MapReduce</h2>
+
+<p>Allow using result set from Drill queries as input to the Hadoop/MapReduce
+jobs.</p>
+
+<h2 id="hive-views">Hive views</h2>
+
+<p>Query data from existing Hive views using Drill queries. Drill needs to parse
+the HiveQL and translate them appropriately (into Drill&#39;s SQL or
+logical/physical plans) to execute the requests.</p>
+
+<h2 id="yarn">YARN</h2>
+
+<p><a href="https://issues.apache.org%0A/jira/browse/DRILL-1170">https://issues.apache.org/jira/browse/<em>DRILL</em>-1170</a></p>
+
+<h2 id="spark">Spark</h2>
+
+<p>Provide ability to invoke Drill queries as part of Apache Spark programs. This
+gives ability for Spark developers/users to leverage Drill richness of the
+query layer , for data source access and as low latency execution engine.</p>
+
+<h2 id="hue">Hue</h2>
+
+<p>Hue is a GUI for users to interact with various Hadoop eco system components
+(such as Hive, Oozie, Pig, HBase, Impala ...). The goal of this project is to
+expose Drill as an application inside Hue so users can explore Drill metadata
+and do SQL queries.</p>
+
+<h2 id="phoenix">Phoenix</h2>
+
+<p>Phoenix provides a low latency query layer on HBase for operational
+applications. The goal of this effort is to explore opportunities for
+integrating Phoenix with Drill.</p>
+</div>
+
+
+<div id="footer" class="mw">
+<div class="wrapper">
+Copyright © 2012-2014 The Apache Software Foundation, licensed under the Apache License, Version 2.0.<br>
+Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.<br/><br/>
+</div>
+</div>
+
+<script>
+(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ga('create', 'UA-53379651-1', 'auto');
+ga('send', 'pageview');
+</script>
+
+</body>
+</html>

Added: drill/site/trunk/content/drill/docs/apache-drill-documentation/index.html
URL: http://svn.apache.org/viewvc/drill/site/trunk/content/drill/docs/apache-drill-documentation/index.html?rev=1651949&view=auto
==============================================================================
--- drill/site/trunk/content/drill/docs/apache-drill-documentation/index.html (added)
+++ drill/site/trunk/content/drill/docs/apache-drill-documentation/index.html Thu Jan 15 05:11:44 2015
@@ -0,0 +1,92 @@
+<!DOCTYPE html>
+<html>
+
+<head>
+
+<meta charset="UTF-8">
+
+
+<title>Apache Drill Documentation - Apache Drill</title>
+
+<link href="/css/syntax.css" rel="stylesheet" type="text/css">
+<link href="/css/style.css" rel="stylesheet" type="text/css">
+<link href="/css/arrows.css" rel="stylesheet" type="text/css">
+<link href="/css/button.css" rel="stylesheet" type="text/css">
+
+<link rel="shortcut icon" href="/favicon.ico" type="image/x-icon">
+<link rel="icon" href="/favicon.ico" type="image/x-icon">
+
+<script language="javascript" type="text/javascript" src="/js/lib/jquery-1.11.1.min.js"></script>
+<script language="javascript" type="text/javascript" src="/js/lib/jquery.easing.1.3.js"></script>
+<script language="javascript" type="text/javascript" src="/js/modernizr.custom.js"></script>
+<script language="javascript" type="text/javascript" src="/js/script.js"></script>
+
+</head>
+
+<body onResize="resized();">
+
+<div class="bui"></div>
+
+<div id="search">
+<input type="text" placeholder="Enter search term here">
+</div>
+
+<div id="menu" class="mw">
+<ul>
+  <li class="logo"><a href="/"></a></li>
+  <li>
+    <a href="/overview/">Documentation</a>
+    <ul>
+      <li><a href="/overview/">Overview&nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="https://cwiki.apache.org/confluence/display/DRILL/Apache+Drill+in+10+Minutes" target="_blank">Drill in 10 Minutes</a></li>
+      <li><a href="/why/">Why Drill? &nbsp;&nbsp;&nbsp;&nbsp;</a></li>
+      <li><a href="/architecture/">Architecture</a></li>
+    </ul>
+  </li>
+  <li>
+    <a href="/community/">Community</a>
+    <ul>
+      <li><a href="/team/">Team</a></li>
+      <li><a href="/community/#events">Events and Meetups</a></li>
+      <li><a href="/community/#mailinglists">Mailing Lists</a></li>
+      <li><a href="/community/#getinvolved">Get Involved</a></li>
+      <li><a href="https://issues.apache.org/jira/browse/DRILL/" target="_blank">Issue Tracker</a></li>
+      <li><a href="https://github.com/apache/drill" target="_blank">GitHub</a></li>
+    </ul>
+  </li>
+  <li><a href="/faq/">FAQ</a></li>
+  <li><a href="/blog/">Blog</a></li>
+  <li style="width:30px; padding-left: 2px; padding-right:10px"><a href="https://twitter.com/apachedrill" target="_blank"><img src="/images/twitterbw.png" alt="" align="center" width="22" style="padding: 0px 10px 1px 0px;"></a> </li>
+  <li class="l"><span>&nbsp;</span></li>
+  <li class="d"><a href="/download/">Download</a></li>
+</ul>
+</div>
+
+<div class="int_title">
+<h1>Apache Drill Documentation</h1>
+
+</div>
+
+<div class="int_text" align="left"><p>The Drill documentation covers how to install, configure, and use Apache Drill.</p>
+</div>
+
+
+<div id="footer" class="mw">
+<div class="wrapper">
+Copyright © 2012-2014 The Apache Software Foundation, licensed under the Apache License, Version 2.0.<br>
+Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.<br/><br/>
+</div>
+</div>
+
+<script>
+(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ga('create', 'UA-53379651-1', 'auto');
+ga('send', 'pageview');
+</script>
+
+</body>
+</html>



Mime
View raw message