Return-Path: X-Original-To: apmail-incubator-general-archive@www.apache.org Delivered-To: apmail-incubator-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F26C010C7B for ; Thu, 17 Sep 2015 17:29:33 +0000 (UTC) Received: (qmail 59348 invoked by uid 500); 17 Sep 2015 17:29:33 -0000 Delivered-To: apmail-incubator-general-archive@incubator.apache.org Received: (qmail 59099 invoked by uid 500); 17 Sep 2015 17:29:33 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 58977 invoked by uid 99); 17 Sep 2015 17:29:32 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Sep 2015 17:29:32 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 7FCD11A2224 for ; Thu, 17 Sep 2015 17:29:32 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 4.417 X-Spam-Level: **** X-Spam-Status: No, score=4.417 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, HTTP_EXCESSIVE_ESCAPES=1.516, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id OPCk_dfARYtj for ; Thu, 17 Sep 2015 17:29:18 +0000 (UTC) Received: from mail-wi0-f174.google.com (mail-wi0-f174.google.com [209.85.212.174]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 510D8204C8 for ; Thu, 17 Sep 2015 17:29:18 +0000 (UTC) Received: by wicgb1 with SMTP id gb1so477767wic.1 for ; Thu, 17 Sep 2015 10:29:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=bflC3IP+vY87FGc/896seXJGqR3D3Audkcafonfft4k=; b=h6PWVFbaESXe8SRI0X2MM0ytR0Wdi90B2sintiAK5D/9y9GziXpOb78QhKvmAAUoTc kGQPC/Z5mzkim6vWp3nKkD73Q8whQYJcSL94SvPYDxobOo37YQpmx5gDnXubpLjD/Mut XJQoXuf50THVLHO962cyDngCes/ZBhOZA9WmDZ21VJVP6OGG8Na//1+SlpwVB++rReLB CI3Wq4AX9ddpLsn7ZPcnTqhU9kJcGNPOlpAyS3Ng2Nj4diPJEXBThDXaEg0QIxd5uIXq SguwvHwVIxyMXQJu8yskLu/uQP4Osn5MSSLoJlqc5VpnZ18bQeQPZW4mliiMJqjbbIUb bVpg== MIME-Version: 1.0 X-Received: by 10.180.108.175 with SMTP id hl15mr9991253wib.1.1442510957992; Thu, 17 Sep 2015 10:29:17 -0700 (PDT) Received: by 10.27.130.85 with HTTP; Thu, 17 Sep 2015 10:29:17 -0700 (PDT) In-Reply-To: References: Date: Thu, 17 Sep 2015 13:29:17 -0400 Message-ID: Subject: Re: [VOTE] Accept Rya into the Apache Incubator From: Phillip Rhodes To: general@incubator.apache.org Content-Type: multipart/alternative; boundary=e89a8f3ba55f3c5fb4051ff4c17b --e89a8f3ba55f3c5fb4051ff4c17b Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable +1 This message optimized for indexing by NSA PRISM On Thu, Sep 17, 2015 at 1:22 PM, Seetharam Venkatesh < venkatesh@innerzeal.com> wrote: > +1, the proposal looks very interesting. We have a metadata and governanc= e > solution built in Apache Atlas (http://atlas.incubator.apache.org) and us= e > an array of technologies to store the relationships using Titan and HBase= . > This will be of interest to Atlas community. > > If you are still looking for mentoring, I can volunteer and help. > > Thanks, > Venkatesh > > On Thu, Sep 17, 2015 at 10:03 AM Sean Busbey wrote: > > > My apologies, I was on vacation and missed the start of this thread. > > > > late +1 (binding) > > > > On Wed, Sep 16, 2015 at 8:01 AM, Adina Crainiceanu > wrote: > > > > > +1 of course :) I'm very excited at the prospect of joining the Apach= e > > > community! > > > > > > > > > --Adina Crainiceanu > > > > > > On Mon, Sep 14, 2015 at 11:17 AM, Adam Fuchs > wrote: > > > > > > > Thanks again for the healthy discussion on Rya. With that, I would > like > > > to > > > > call a VOTE for accepting Rya as a new incubator project. > > > > > > > > The proposal text is included below, and is posted on the wiki here= : > > > > https://wiki.apache.org/incubator/RyaProposal > > > > > > > > The discussion thread on Rya starts here: > > > > > > > > > > > > > > http://mail-archives.apache.org/mod_mbox/incubator-general/201509.mbox/%3= CCALt5_xJKtRcUr3WGjfrY77DYWF0-8DWi%3DzyS7hrMFTg%2BYAORjQ%40mail.gmail.com%3= E > > > > > > > > The vote will be open until Thu Sep 17 15:15:00 UTC 2015. > > > > > > > > [ ] +1 accept Rya in the Incubator > > > > [ ] =C2=B10 > > > > [ ] -1 because... > > > > > > > > Thanks, > > > > Adam > > > > > > > > > > > > =3D Rya Proposal =3D > > > > =3D=3D Abstract =3D=3D > > > > Rya (pronounced "ree-uh" /r=C4=93=C9=99/) is a cloud-based RDF trip= le store > that > > > > supports SPARQL queries. > > > > > > > > =3D=3D Proposal =3D=3D > > > > Rya is a scalable RDF data management system built on top of > Accumulo. > > > Rya > > > > uses novel storage methods, indexing schemes, and query processing > > > > techniques that scale to billions of triples across multiple nodes. > Rya > > > > provides fast and easy access to the data through SPARQL, a > > conventional > > > > query mechanism for RDF data. > > > > > > > > =3D=3D Background =3D=3D > > > > RDF is a World Wide Web Consortium (W3C) standard used in describin= g > > > > resources on the Web. The smallest data unit is a triple consisting > of > > > > subject, predicate, and object. Using this framework, it is very ea= sy > > to > > > > describe any resource, not just Web related. For example, if you wa= nt > > to > > > > say that Alice is a professor, you can represent this as an RDF > triple > > > like > > > > (Alice, rdf:type, Professor). In general, RDF is an open world > > framework > > > > that allows anyone to make any statement about any resource, which > > makes > > > it > > > > a popular choice for expressing a large variety of data. > > > > > > > > RDF is used in conjunction with the Web Ontology Language (OWL). OW= L > > is a > > > > framework for describing models or ontologies for RDF. It defines > > > concepts, > > > > relationships, and/or structure of RDF documents. These models can = be > > > used > > > > to 'reason/infer' information about entities within a given domain. > For > > > > example, you can express that a Professor is a sub class of Faculty= , > > > > (Professor, rdfs:subClassOf, Faculty) and knowing that (Alice, > > rdf:type, > > > > Professor), it can be inferred that (Alice, rdf:type, Faculty). > > > > > > > > SPARQL is an RDF query language. Similar with SQL, SPARQL has SELEC= T > > and > > > > WHERE clauses; however, it is based on querying and retrieving RDF > > > triples. > > > > > > > > Work on Rya, a large scale distributed system for storing and > querying > > > RDF > > > > data, started in 2010. > > > > > > > > =3D=3D Rationale =3D=3D > > > > With the increase in data size, there is a need for scalable system= s > > for > > > > storing and retrieving RDF data in a cluster of nodes. We believe > that > > > Rya > > > > can fulfill that role. We expect that communities within government= , > > > health > > > > care, finance, and others who generate large amounts of RDF data wi= ll > > be > > > > most interested in this project. > > > > > > > > From its inception, the project operated with an Apache-style > license, > > > but > > > > it was open to mostly US government-related projects only. We belie= ve > > > that > > > > having the project and the development open for all will benefit bo= th > > the > > > > project and the interested communities. > > > > > > > > =3D=3D Current Status =3D=3D > > > > The project source code and documentation are currently hosted in a > > > private > > > > repository on Github. New users are added to the repository upon > > request. > > > > > > > > =3D=3D=3D Meritocracy =3D=3D=3D > > > > Meritocracy is the model that we currently follow, and we want to > > build a > > > > larger and more diverse developer community by becoming an Apache > > > project. > > > > > > > > =3D=3D=3D Community =3D=3D=3D > > > > Rya has being building a community of users and developers for the > > past 3 > > > > years. There is currently an active workgroup with monthly meetings > and > > > the > > > > number of participants in the meeting is increasing. > > > > > > > > =3D=3D=3D Core Developers =3D=3D=3D > > > > The core developers are a diverse group of people who are either > > > government > > > > employees or former / current government contractors from different > > > > companies. > > > > > > > > =3D=3D=3D Alignment =3D=3D=3D > > > > Rya is built on top of Accumulo, an Apache project. > > > > > > > > =3D=3D Known Risks =3D=3D > > > > =3D=3D=3D Orphaned Products =3D=3D=3D > > > > There is a very small risk of becoming orphaned. The current > > contributors > > > > are strongly committed to the project, there is a large enough numb= er > > of > > > > developers interested in contributing to the project, and we believ= e > > that > > > > the support for the project will continue to grow from the interest= ed > > > > communities. > > > > > > > > =3D=3D=3D Inexperience with Open Source =3D=3D=3D > > > > The initial committers have various degrees of experience with open > > > source > > > > projects - from very new to experienced. This project was open sour= ce > > > > within government from the beginning. We are aware that it will be > > > > different and more difficult functioning in a real open source > > > environment. > > > > We are enthusiastic and committed to learning the Apache way and > being > > > > successful in operating under Apache's development process. > > > > > > > > =3D=3D=3D Homogenous Developers =3D=3D=3D > > > > The current list of developers form a heterogeneous group, with > people > > > for > > > > academia, government, and industry, collaborating from distributed > > > > geographic locations. We aim to expand the list of contributors wit= h > > the > > > > help of the Apache incubation process. > > > > > > > > =3D=3D=3D Reliance on Salaried Developers =3D=3D=3D > > > > Many but not all of the developers working on the project are > salaried > > > > employees, paid to work on this project. They will continue to > > contribute > > > > to the open source project. Some of the initial committers continue= d > as > > > > volunteers even if no longer employed to work on this project and > they > > > plan > > > > to continue supporting the project. > > > > > > > > =3D=3D=3D Relationships with Other Apache Products =3D=3D=3D > > > > Rya uses Apache Accumulo, Hadoop, Zookeeper, Maven. > > > > > > > > *Apache Jena API or Apache Commons RDF API could become the RDF AP= I > > used > > > > by Rya, but such a decision was not made. > > > > *Apache Clerezza is database/triple store agnostic, and as such > could > > be > > > > complementary to Rya. > > > > *Apache Stanbol focuses on providing semantic services, while Rya > > > focuses > > > > on providing a distributed triple store solution, with support for > > SPARQL > > > > and OWL reasoning. > > > > *Apache Marmotta provides an implementation of a Linked Data > Platform, > > > and > > > > overlaps in some of the goals and functionality with Rya (RDF tripl= e > > > store, > > > > SPARQL support among others). There are many opportunities for > > > > collaboration with these projects and we are looking forward to suc= h > a > > > > collaboration. > > > > > > > > =3D=3D=3D Apache Brand =3D=3D=3D > > > > Rya has generated interest in the government. It also generated > > interest > > > > within academia and industry. We believe that everyone could benefi= t > > from > > > > having Rya as an open source project. Due to its strong ties to > > Accumulo, > > > > an Apache project, and due to the values of the Apache Foundation, = we > > > > believe that Apache incubator is the right place for Rya. > > > > > > > > =3D=3D Documentation =3D=3D > > > > Two peer-reviewed publications [1,2] about Rya were published in 20= 12 > > and > > > > 2015. More documentation is available in the code. > > > > > > > > [1] Roshan Punnoose, Adina Crainiceanu, David Rapp. [[ > > > > > > > > > > > > > > http://www.usna.edu/Users/cs/adina/research/Rya%5FCloudI%32%30%31%32.pdf|= Rya > > < > http://www.usna.edu/Users/cs/adina/research/Rya%5FCloudI%32%30%31%32.pdf%= 7CRya > > > > > < > > > http://www.usna.edu/Users/cs/adina/research/Rya%5FCloudI%32%30%31%32.pdf%= 7CRya > > > > > > > : > > > > A Scalable RDF Triple Store for the Clouds]]. Proceedings of the 1s= t > > > > International Workshop on Cloud Intelligence, Pages 4:1-4:8, August > > 2012 > > > > > > > > [2] Roshan Punnoose, Adina Crainiceanu, David Rapp. [[ > > > > > > http://www.usna.edu/Users/cs/adina/research/Rya_ISjournal2013.pdf|SPARQ= L > > > > in > > > > the Clouds Using Rya]]. Information Systems, Volume 48, Pages > 181-195, > > > > March 2015 (Available online 23 July 2013) > > > > > > > > =3D=3D Initial Source =3D=3D > > > > The code is currently in a private Github repository, due to securi= ty > > and > > > > IP review processes. We intend to open it up via transferring the > code > > to > > > > an ASF repository. > > > > > > > > =3D=3D Source and Intellectual Property Submission Plan =3D=3D > > > > The source code has been released under the Apache License, Version > 2. > > > > Software grant, and CCLAs have been submitted. ICLAs for initial > > > committers > > > > have been submitted or are in progress. > > > > > > > > =3D=3D External Dependencies =3D=3D > > > > * [[http://rdf4j.org|OpenRDF Sesame]] (BSD license) > > > > * [[http://www.geomesa.org/|GeoMesa]] (Apache License, Version 2.0= ) > > > > * [[https://accumulo.apache.org/|Accumulo]] (Apache License, > Version > > > 2.0) > > > > * [[https://hadoop.apache.org/|Hadoop]] (Apache License, Version > 2.0) > > > > * [[https://pig.apache.org/|Pig]] (Apache License, Version 2.0) > > > > * [[http://tinkerpop.incubator.apache.org/|TinkerPop]] (Apache > > License, > > > > Version 2.0) > > > > > > > > =3D=3D Cryptography =3D=3D > > > > The proposal does not involve any cryptographic code. > > > > > > > > =3D=3D Required Resources =3D=3D > > > > =3D=3D=3D Mailing lists =3D=3D=3D > > > > * private@rya.incubator.apache.org > > > > * dev@rya.incubator.apache.org > > > > * commits@rya.incubator.apache.org > > > > > > > > =3D=3D=3D Git Repository =3D=3D=3D > > > > https://git-wip-us.apache.org/repos/asf/incubator-rya.git > > > > > > > > =3D=3D=3D Issue Tracking =3D=3D=3D > > > > JIRA Rya > > > > > > > > =3D=3D Initial Committers =3D=3D > > > > * Roshan Punnoose, roshanp at gmail dot com > > > > * David Rapp, dnrapp at ncsu dot edu > > > > * Adina Crainiceanu, adinancr at gmail dot com > > > > * Aaron Mihalik, aaron.mihalik at gmail dot com > > > > * Puja Valiyil, pujav65 at gmail dot com > > > > * Jennifer Brown, jennifer.brown at parsons dot com > > > > * Steve Wagner, steve.r.wagner at gmail dot com > > > > > > > > =3D=3D Affiliations =3D=3D > > > > * Roshan Punnoose, Enlighten IT Consulting > > > > * David Rapp, North Carolina State University > > > > * Adina Crainiceanu, US Naval Academy > > > > * Aaron Mihalik, Parsons > > > > * Puja Valiyil, Parsons > > > > * Jennifer Brown, Parsons > > > > * Steve Wagner, Enlighten IT Consulting > > > > > > > > =3D=3D Sponsors =3D=3D > > > > =3D=3D=3D Champion =3D=3D=3D > > > > * Adam Fuchs, ASF Member, afuchs at apache dot org > > > > > > > > =3D=3D=3D Nominated Mentors =3D=3D=3D > > > > * Josh Elser josh dot elser at gmail dot com > > > > * Edward J. Yoon edwardyoon at apache dot org > > > > * Sean Busbey busbey at cloudera dot com > > > > > > > > We are seeking additional mentors > > > > > > > > =3D=3D=3D Sponsoring Entity =3D=3D=3D > > > > Apache Incubator > > > > > > > > > > > > > > > > -- > > > Dr. Adina Crainiceanu > > > http://www.usna.edu/Users/cs/adina/ > > > > > > > > > > > -- > > Sean > > > --e89a8f3ba55f3c5fb4051ff4c17b--