incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Incubator Wiki] Update of "September2013" by TedDunning
Date Wed, 11 Sep 2013 04:57:17 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The "September2013" page has been changed by TedDunning:

Added Drill report.

- Drill
+ Apache: Project Drill
- Drill is a distributed system for interactive analysis of large-scale
- datasets, inspired by Google's Dremel.
+ Description:
+ Apache Drill is a distributed system for interactive analysis of large-scale datasets that
is based on Google's Dremel. Its goal is to efficiently process nested data, scale to 10,000
servers or more and to be able to process petabyes of data and trillions of records in seconds.
  Drill has been incubating since 2012-08-11.
- Three most important issues to address in the move towards graduation:
+ Three Issues to Address in Move to Graduation:
-   1. 
-   2. 
-   3. 
+ 1. Continue to attract new developers and and early users with a variety of skills and viewpoints
+ 2. Continue to develop deeper community skills and knowledge by building additional releases
+ 3. Demonstrate community robustness by rotating project tasks among
+ multiple project members
+ The community has made significant progress on items 1 and 2.
- Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
- aware of?
+ Issues to Call to Attention of PMC or ASF Board:
+ none
- How has the community developed since the last report?
+ How community has developed since last report:
+ The most important activity is the run up to the Milestone 1 release.
+ Additional events include:
+ * Apache Drill project website redesigned to have a new look:
+ * Interactive "How to Run Drill" demo added to the Apache Drill wiki:
+ Mailing list discussions:
+ Subscriptions to the Drill mailing lists have risen to 383. There has been active and increasing
participation in discussions on the developer mailing list, including new participants and
developers. Participation on the user list is growing although still small; mainly activity
takes place on developer mailing list.
+ Activity summary for the dev mailing list:
+ September to date 05/010/2013: 397(mainly jira; some discussion)
+ August 2013, 394  (jira, focused discussions)
+ July 2013,370  (jira; focused discussions)
+ June 2013,297 (jira, focused discussions)
+ Recent topics on the dev mailing list have included:
+ * Usability and introductory tutorials
+ * SQL semantics and extensions to type inference cases
+ * Implementation of various storage engines, including Parquet and ORC.
+ * Optimizer rewrites and operator implementations.
+ Code 
+ For details of code commits, see
+ There has been a very significant ramp up of code commits during this quarter, as shown
in this chart from GitHub:
+ Ten contributors have participated in this GitHub code activity; there have been 77 forks
of the Apache Drill project on GitHub which is a good indicator of strong interest outside
the group of core contributors.
+ Recent code commits include but not limited to:
+ * full end-to-end execution of queries
+ * reorganization of the source tree to simplify initial user experience
+ * a number of new operators for the execution engine
+ * a pro tempore query optimizer that allows a physical plans to be generated
+ * the entire code generation framework
+ * Value Vector implementation
+ Community Interactions
+ The weekly Drill hangout continues, conducted remotely through Google hangouts Tuesday mornings
9am Pacific Time to keep core developers in contact in realtime despite geographical separation.
+ The community stays in touch through @ApacheDrill Twitter ID, and by postings on various
blogs including Apache Drill User
+ Viability of community is also apparent through interest in next meet-up event for the Bay
Area Apache Drill User group in late September, which is already attracting a robust audience.
Volunteers are coming forward from audience members of presentations, such as the Drill workshop
in July (see following).
+ Presentations
+ There have been presentations and a Drill workshop from community members at conferences
and meet-ups.
+ Several Drill contributors have other talks scheduled with different meetups in the upcoming
+ Sample presentations (out of many):
+ * Drill talks by @mhausenblas at Hive London and in Paris in June
+ * Talk on Apache Drill by @mhausenblas and @ted_dunning at Berlin Buzzwords
+ * Apache Drill hands-on workshop by @ted_dunning and @intjesus at OSCON in Portland, Oregon
USA in July for ~40 participants. 
+ * Apache Drill project featured by panelist @tshiran in Aug for the "Hadoop + SQL" Hive
Data Think Tank event in California Bay Area. 
+ * Next meeting for the Bay Area Apache Drill User group is planned for September with talk
and demo by Steve Phillips 
+ Slides
+ Slides from Drill presentations posted online such as at slideshare get a large number of
+ Example:
+ OSCON Apache Drill workshop posted 1 Aug 2013 by Ted Dunning and Jacques Nadeau, 436 views.
+ Articles
+ Examples of articles on Apache Drill since last report include:
+ * Article by @mhausenblas and @intjesus "Introduction to Apache Drill: Interactive Ad-Hoc
Query for Large-scale Datasets"  Michael Hausenblas and Jacques Nadeau. Big Data. June 2013,
1(2): 100-104. doi:10.1089/big.2013.0011.
+ * A blog post by @Ellen_Friedman reports on that Drill-via-Amazon-Cloud event and includes
links to slides:
+ * Drill  blog article by S. J. Vaughan-Nichols "Drilling into Big Data with Apache Drill"
in Aug:
+ * A blog posting on Drill by T. Shiran as a prelude to the Hadoop + SQL event by Hive Data
Think Tank can be found here:
+ Social Networking
+ @ApacheDrill Twitter entity is active and has grown by ~20%, to 437 followers.
- How has the project developed since the last report?
+ How project has developed since last report:
+ 1.	Website homepage has a new design
+ 2.	Wiki has been updated 
+ 3.	Significant code drops have been checked in from a number of developers
+ 4.	Started to create release candidates for the milestone one [first] release of Drill
+ 5.	New developers are contributing.
+ 6.	Additional non-code contributors have become active and are being encouraged
+ Please check this [x] when you have filled in the report for Drill.
- Date of last release: 
- When were the last committers or PMC members elected?
@@ -241, +336 @@

  There is a lot of activity but it does not seem to include any drive to make any release.
  There is tremendous community building activity.
  Heavy on community and light on Release/IP.
+ Rebuttal to shepherd notes: (Ted Dunning/tdunning)
+ I think that the Dave is substantially incorrect in his assessment that there "does not
seem to include any drive to make any release".  This is factually and patently incorrect.
 I have asked that the shepherd reconsider this comment and retract it as it does not reflect
the substantial progress that the community is making towards a release.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message