Return-Path: X-Original-To: apmail-incubator-general-archive@www.apache.org Delivered-To: apmail-incubator-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0E27F17C53 for ; Thu, 21 May 2015 16:59:29 +0000 (UTC) Received: (qmail 63257 invoked by uid 500); 21 May 2015 16:59:28 -0000 Delivered-To: apmail-incubator-general-archive@incubator.apache.org Received: (qmail 63063 invoked by uid 500); 21 May 2015 16:59:28 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 63050 invoked by uid 99); 21 May 2015 16:59:28 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 May 2015 16:59:28 +0000 Received: from mail-wi0-f175.google.com (mail-wi0-f175.google.com [209.85.212.175]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id D0C831A0324 for ; Thu, 21 May 2015 16:59:27 +0000 (UTC) Received: by wicmx19 with SMTP id mx19so18944606wic.0 for ; Thu, 21 May 2015 09:59:26 -0700 (PDT) X-Received: by 10.194.203.138 with SMTP id kq10mr6934870wjc.124.1432227566590; Thu, 21 May 2015 09:59:26 -0700 (PDT) MIME-Version: 1.0 Received: by 10.28.89.131 with HTTP; Thu, 21 May 2015 09:58:46 -0700 (PDT) In-Reply-To: References: From: Arvind Prabhakar Date: Thu, 21 May 2015 09:58:46 -0700 Message-ID: Subject: Re: [VOTE] Accept Trafodion into Apache Incubator To: general@incubator.apache.org Content-Type: multipart/alternative; boundary=047d7b86deb258205405169a776d --047d7b86deb258205405169a776d Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable +1 (binding) Regards, Arvind Prabhakar On Tue, May 19, 2015 at 2:27 PM, Stack wrote: > Following the discussion earlier in the thread [1], I would like to call = a > VOTE to accept Trafodion as a new Apache Incubator project. > > The proposal is available on the wiki at [2] and is also attached to this > mail. > > The VOTE is open for at least the next 72 hours: > > [ ] +1 accept Trafodion into the Apache Incubator > [ ] =C2=B10 Abstain > [ ] -1 because... > > I am +1 (binding) > > Thank you, > St.Ack > > 1. > > http://mail-archives.apache.org/mod_mbox/incubator-general/201505.mbox/%3= CCADcMMgG4NHtmFZ519iqgZLA8Lj-E7VmaQ%3Dr8C011LuS5pR0Vkw%40mail.gmail.com%3E > 2. https://wiki.apache.org/incubator/TrafodionProposal > > > > > Trafodion Apache Incubator Proposal > > Abstract > > Trafodion is a webscale SQL-on-Hadoop solution enabling transactional or > operational workloads on Hadoop. > > Proposal > > Apache Trafodion builds on the scalability, elasticity, and flexibility o= f > Hadoop. Trafodion extends Hadoop to provide guaranteed transactional > integrity, enabling new kinds of big data applications to run on Hadoop. > Key > features of Apache Trafodion include: > > * Full-functioned ANSI SQL language support > * JDBC/ODBC connectivity for Linux/Windows clients > * Distributed ACID transaction protection across multiple statements, > tables and rows > * Performance improvements for OLTP workloads with compile-time and > run-time optimizations > * Support for large data sets using a parallel-aware query optimizer > * ANSI SQL security and data integrity constraints including referential > integrity > > Hewlett-Packard Company submits this proposal to donate its Apache Licens= e, > Version 2.0 open source project known as Trafodion, its source code, > documentation, and web site content to the Apache Software Foundation in > order to build an open source community > > Background > > Trafodion is an open source project sponsored by HP, incubated at HP Labs > and HP-IT, to develop an enterprise-class SQL-on-Hadoop solution targetin= g > big data transactional or operational workloads. HP publically announced > the open source project and uploaded the source code to GitHub in June > 2014. > > The SQL compiler, optimizer and executor components of Trafodion have a > rich heritage. Under development since 1993, they were released as > commercial closed source software in various flavors such as HP NonStop > SQL/MX and HP Neoview. NonStop SQL/MX was designed for online transaction > processing on HP=E2=80=99s NonStop (formerly Tandem) fault-tolerant serve= rs and is > known for its high availability, scalability, and performance. Hundreds o= f > companies and thousands of servers are running mission-critical > applications today on NonStop SQL/MX. In addition, much of these componen= ts > today are running internal to HP as the core of its Enterprise Data > Warehouse (EDW), managing over a PB of data. > > Starting in 2013, the software was modified to run on HBase and a new > distributed transaction manager was written to run as an HBase > co-processor. > > Unlike most NOSQL and other SQL-on-Hadoop open source projects, Trafodion > provides comprehensive ANSI SQL language support including full-functione= d > data definition (DDL), data manipulation (DML), transaction control (TCL) > and database utility support. > > Trafodion provides comprehensive and standard SQL data manipulation suppo= rt > including SELECT, INSERT, UPDATE, DELETE, and UPSERT/MERGE syntax with > language options including join variants, unions, where predicates, > aggregations (group by and having), sort ordering, sampling, correlated a= nd > nested sub-queries, cursors, and many SQL functions. > > Utilities are provided for updating table statistics used by the optimize= r > for costing (i.e. selectivity/cardinality estimates) plan alternatives, f= or > displaying the chosen SQL execution plan, plan shaping, backup and > restoring the database, data loading and unloading, and a command line > utility for interfacing with the database engine. > > Explicit control statements are provided to allow applications to define > transaction boundaries and to abort transactions when warranted, includin= g > BEGIN WORK, COMMIT WORK, ROLLBACK WORK and SET TRANSACTION. > > Trafodion supports ANSI=E2=80=99s grant/revoke semantics to define user a= nd role > privileges in terms of managing and accessing the database objects. > > Rationale > > The name =E2=80=9CTrafodion=E2=80=9D (the Welsh word for transactions, pr= onounced > =E2=80=9CTra-vod-eee-on=E2=80=9D) was chosen specifically to emphasize th= e differentiation > that Trafodion provides in closing a critical gap in the Hadoop ecosystem= . > Trafodion builds on the scalability, elasticity, and flexibility of Hadoo= p. > Trafodion extends Hadoop to provide guaranteed transactional integrity, > enabling new kinds of big data applications to run on Hadoop. > > Current Status > > HP released the Trafodion code under the Apache License, Version 2, in Ju= ne > of 2014. Since that time, we have had one major release in January 2015 a= nd > one minor release in April 2015. The focus of these releases has been in > getting our base functionality, including security, working on top of > Apache HBase, as well as improving performance, availability and > scalability, and integrating better with HBase. > > Meritocracy > > We want to build a diverse developer community, based on the Apache Way, > around Trafodion. To help developers become contributors, we have > documentation on the wiki about the architecture, the source tree > structure, and an example enhancement. We plan to publish our project > backlog to the community, specifically highlighting areas where developer= s > new to Trafodion may best start contributing, such as extending the > database functionality with User Defined Routines (UDRs) and integrating > with other Apache projects in the Hadoop ecosystem. > > Community > > We have already begun building a community but at this time the community > consists only of Trafodion developers =E2=80=93 all HP employees =E2=80= =93 and prospective > users. We have participated in and hosted HBase Meetups and intend to ram= p > up our community building efforts. > > The Trafodion project has seen interest in China, where HP has conducted > proof-of-concepts with multiple companies and expects to see some of its > first commercial deployments. To help recruit contributors and users in > China, members of the team are translating Trafodion wiki content into > Mandarin. > > Core Developers > > The core developers are very experienced in database and transaction > monitor technology, with many having spent more than 20 years working in > this space. > > Alignment > > Apache Trafodion relies on Apache HBase as its storage engine. The > development team has collaborated with and gained valuable advice from > working with the Apache HBase core developers. Apache Trafodion has > federation capabilities as well, and can query Trafodion tables stored in > HBase, native HBase tables, and Apache Hive tables. > > Known Risks > > Orphaned Products > > HP Labs and HP-IT have been incubating Trafodion development for almost t= wo > years. This is part of HP=E2=80=99s strategy to leverage its investment i= n database > software and bring software to market as open source and is similar to HP= =E2=80=99s > efforts with OpenStack. Trafodion builds on HP=E2=80=99s equity investmen= t in the > Hadoop ecosystem and its efforts to monetize Hadoop through hardware, > software, and services. HP wants Trafodion to be successful, as HP will > offer a commercially supported distribution of Trafodion. > > Inexperience with Open Source > > We have been working with open source software in building closed source > software for well over two decades. To help transition to doing open sour= ce > development, the development team received guidance and best practices fr= om > HP developers working on OpenStack open source projects, many of whom hav= e > experience working on Apache and other open source projects as well. Sinc= e > releasing Trafodion as an open source project in June of 2014, the > committers and contributors have moved forward using open source > development processes and tools for bug tracking and design blueprints an= d > Jenkins for continuous integration. As part of the incubation process, we > recognize we may need to change some of our development processes/tools a= nd > conduct our discussions using Apache email dlists. > > Homogenous Developers > > Since the initial development of Trafodion has been supported by HP, all = of > the current developers are HP employees. Through the support of the Apach= e > incubation project, we aim to expand the list of developers and gain > contributors from related SQL-on-Hadoop projects and the Apache HBase > project. Trafodion developers are experienced with distributed developmen= t > processes, being primarily based in Palo Alto, CA; Austin, TX; and > Shanghai, China. Trafodion is written in C++ and Java. > > Reliance on Salaried Developers > > Currently all of the developers working on the project are paid by their > employer to work on the project. These developers will work on the open > source project as well as work on the commercially supported distribution > of Trafodion that HP will offer. > > Relationship with Other Apache Products > > Trafodion is built upon Apache HBase and extends it to support ACID > transactions with HBase co-processors for distributed transaction > management and recovery. Trafodion envisions future collaborations with t= he > Apache HBase project on performance optimizations, such as in the areas o= f > mixed workload support, High Availability, etc. It also provides > transactional support and querying from native HBase tables as well. > > Trafodion uses Apache Zookeeper to coordinate and manage the distribution > of connection services across the cluster for load-balancing and high > availability reconnection purposes in the event a Trafodion process shoul= d > fail. > > Trafodion also envisions working with the Apache Ambari project on enabli= ng > better Trafodion manageability. While Ambari focuses on system and > component level performance metrics, Trafodion manageability will focus i= n > a complimentary way on database workload monitoring and performance > analytics with capabilities more geared towards database administrators. > > There are alternative open source projects that are providing SQL-on-Hado= op > capabilities, such as Apache Hive, Apache Drill, and Apache Phoenix. Thes= e > are more focused on reporting and analytics across data structures > supported on HDFS. In comparison to all of these technologies Trafodion > provides a very complete implementation of ANSI SQL, one of the most > sophisticated optimizers for such workloads, a completely parallel data > flow architecture that does not materialize intermediate results unless > necessary, full ACID transactional support, ANSI GRANT/REVOKE security, a= nd > other capabilities that would take decades to build in these products. On > the other hand currently Trafodion is just focused on HBase and querying > Hive, whereas Hive and Drill provide access to other data formats in HDFS= . > > An Excessive Fascination with the Apache Brand > > We understand the reputation and value of the Apache brand, and no doubt > believe that it will help us attract contributors and users. Our primary > goal is to follow a proven, open source development and community buildin= g > model that will make Trafodion successful and enable better collaboration > with other Apache projects in the Hadoop ecosystem. We also understand th= e > rules and guidelines about the use of the Apache brand and intend to foll= ow > them. > > Documentation > > Documentation and technical details on Trafodion can be found at: > http://www.trafodion.org/ > > Initial Source > > The source is available today in a public github repository: > https://github.com/trafodion/trafodion. > > Source and Intellectual Property Submission Plan > > The source code has already been released under the Apache License, Versi= on > 2. The manuals have been released in Adobe PDF format. As part of the > submission process, the source for the manuals will be converted from a > proprietary DocBook XML format to AsciiDoc. > > External Dependencies > > Two dependencies do not have Apache compatible licenses and will be > addressed as we enter incubation. One dependency is log4cpp, which is > licensed under the LGPL. A compatible alternative might be Apache incubat= or > project log4cxx. The other dependency is unixodbc, which is used as the > ODBC driver manager. We will look into how Apache Hive manages being able > to use this incompatible software and do similar. All other dependencies > have Apache compatible licenses, including Apache 2.0, MIT/X11, MIT, and > BSD. > > Cryptography > > Trafodion does not contain any cryptographic code. It does call > cryptographic libraries: OpenSSL for C++ code and Java Cryptography > Extension (JCE) for Java code. > > Required Resources > > Mailing Lists > > private@trafodion.incubator.apache.org > dev@trafodion.incubator.apache.org commits@trafodion.incubator.apache.org > > Git Repository > > https://git-wip-us.apache.org/repos/afs/incubator-trafodion.git > > Issue Tracking > > JIRA: JIRA Trafodion (Trafodion) > > > Initial Committers and Affiliation > > Dave Birdsall, Hewlett-Packard Company, Dave.Birdsallhpcom > Matt Brown, Hewlett-Packard Company, mattbrownhpcom > Tharak Capirala, Hewlett-Packard Company, Tharak.Capiralahpcom > Alice Chen, Hewlett-Packard Company, Alice.Chenhpcom > John DeRoo, Hewlett-Packard Company, John.Deroohpcom > Roberta Marton, Hewlett-Packard Company, Roberta.Martonhpcom > Amanda Moran, Hewlett-Packard Company, Amanda.Kay.Moranhpcom > Suresh Subbiah, Hewlett-Packard Company, Suresh.Subbiahhpcom > Sandyha Sundaresan, Hewlett-Packard Company, > Sandhya.Sundaresanhpcom > > Sponsors > > Champion > > Michael Stack, Stackapacheorg > > Nominated Mentors > > Andrew Purtell apurtellapacheorg > Devaraj Das, ddasapacheor > Enis S=C3=B6ztutar, Enisapacheorg > Lars Hofhansl, larshapacheorg > Michael Stack, Stackapacheorg > Roman Shaposhnik, rshaposhnikpivotalio > > Sponsoring Entity > > Apache Incubator PMC > --047d7b86deb258205405169a776d--