Return-Path: X-Original-To: apmail-incubator-general-archive@www.apache.org Delivered-To: apmail-incubator-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9B0311077C for ; Mon, 14 Sep 2015 16:26:19 +0000 (UTC) Received: (qmail 48563 invoked by uid 500); 14 Sep 2015 16:26:12 -0000 Delivered-To: apmail-incubator-general-archive@incubator.apache.org Received: (qmail 48370 invoked by uid 500); 14 Sep 2015 16:26:12 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 48359 invoked by uid 99); 14 Sep 2015 16:26:12 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Sep 2015 16:26:12 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id F2A181A19EF for ; Mon, 14 Sep 2015 16:26:11 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.517 X-Spam-Level: * X-Spam-Status: No, score=1.517 tagged_above=-999 required=6.31 tests=[HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTTP_EXCESSIVE_ESCAPES=1.516, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id ou_wkS79paOq for ; Mon, 14 Sep 2015 16:26:00 +0000 (UTC) Received: from mail-wi0-f178.google.com (mail-wi0-f178.google.com [209.85.212.178]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 083A34414C for ; Mon, 14 Sep 2015 16:26:00 +0000 (UTC) Received: by wicgb1 with SMTP id gb1so149156453wic.1 for ; Mon, 14 Sep 2015 09:25:53 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-type:content-transfer-encoding; bh=4xSpX1qNOzBsgnGi2FGhNSKPb7ys6CsyxLCWYOmH4qo=; b=b2GUtEU4AgK86ZxQRlqhvS5+MWxpqDPOvbC2Fir7lsjjeR6QaZe23Oz4hLe4++OUQ9 vbJ/g2aKj4c3w1m6/ZvZK25IogVdW+Sza6j6YkX7W6AiVAQhDbX6M74T/tOJBMZZTbb1 836/QXbBf3w8R+1IBfQOkJVjZxJKq00YU18pt9Gyoo5copfBM0jdLtVubrS2dIJepWrI JZrF7poyBNVJmM9AiCgb0w38714/48r3xSZpyCX9Hnkh26paWcWxgA4bng9uKxorsLda RQ+zJiKmTpjUF7ULW1kt5qhvHiZCnxi19fX5o1yqueP8WEmTsVsOcIx92JBzn0wuTPuZ Hn3g== X-Received: by 10.195.13.200 with SMTP id fa8mr12575824wjd.9.1442247953130; Mon, 14 Sep 2015 09:25:53 -0700 (PDT) Received: from [192.168.0.5] (cpc37-aztw23-2-0-cust35.18-1.cable.virginm.net. [94.174.128.36]) by smtp.googlemail.com with ESMTPSA id go5sm15187189wib.3.2015.09.14.09.25.51 for (version=TLSv1/SSLv3 cipher=OTHER); Mon, 14 Sep 2015 09:25:51 -0700 (PDT) Subject: Re: [VOTE] Accept Rya into the Apache Incubator To: general@incubator.apache.org References: From: Andy Seaborne Message-ID: <55F6F50F.8080801@apache.org> Date: Mon, 14 Sep 2015 17:25:51 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.2.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit +1 (binding) On 14/09/15 16:17, Adam Fuchs wrote: > Thanks again for the healthy discussion on Rya. With that, I would like to > call a VOTE for accepting Rya as a new incubator project. > > The proposal text is included below, and is posted on the wiki here: > https://wiki.apache.org/incubator/RyaProposal > > The discussion thread on Rya starts here: > http://mail-archives.apache.org/mod_mbox/incubator-general/201509.mbox/%3CCALt5_xJKtRcUr3WGjfrY77DYWF0-8DWi%3DzyS7hrMFTg%2BYAORjQ%40mail.gmail.com%3E > > The vote will be open until Thu Sep 17 15:15:00 UTC 2015. > > [ ] +1 accept Rya in the Incubator > [ ] ±0 > [ ] -1 because... > > Thanks, > Adam > > > = Rya Proposal = > == Abstract == > Rya (pronounced "ree-uh" /rēə/) is a cloud-based RDF triple store that > supports SPARQL queries. > > == Proposal == > Rya is a scalable RDF data management system built on top of Accumulo. Rya > uses novel storage methods, indexing schemes, and query processing > techniques that scale to billions of triples across multiple nodes. Rya > provides fast and easy access to the data through SPARQL, a conventional > query mechanism for RDF data. > > == Background == > RDF is a World Wide Web Consortium (W3C) standard used in describing > resources on the Web. The smallest data unit is a triple consisting of > subject, predicate, and object. Using this framework, it is very easy to > describe any resource, not just Web related. For example, if you want to > say that Alice is a professor, you can represent this as an RDF triple like > (Alice, rdf:type, Professor). In general, RDF is an open world framework > that allows anyone to make any statement about any resource, which makes it > a popular choice for expressing a large variety of data. > > RDF is used in conjunction with the Web Ontology Language (OWL). OWL is a > framework for describing models or ontologies for RDF. It defines concepts, > relationships, and/or structure of RDF documents. These models can be used > to 'reason/infer' information about entities within a given domain. For > example, you can express that a Professor is a sub class of Faculty, > (Professor, rdfs:subClassOf, Faculty) and knowing that (Alice, rdf:type, > Professor), it can be inferred that (Alice, rdf:type, Faculty). > > SPARQL is an RDF query language. Similar with SQL, SPARQL has SELECT and > WHERE clauses; however, it is based on querying and retrieving RDF triples. > > Work on Rya, a large scale distributed system for storing and querying RDF > data, started in 2010. > > == Rationale == > With the increase in data size, there is a need for scalable systems for > storing and retrieving RDF data in a cluster of nodes. We believe that Rya > can fulfill that role. We expect that communities within government, health > care, finance, and others who generate large amounts of RDF data will be > most interested in this project. > > From its inception, the project operated with an Apache-style license, but > it was open to mostly US government-related projects only. We believe that > having the project and the development open for all will benefit both the > project and the interested communities. > > == Current Status == > The project source code and documentation are currently hosted in a private > repository on Github. New users are added to the repository upon request. > > === Meritocracy === > Meritocracy is the model that we currently follow, and we want to build a > larger and more diverse developer community by becoming an Apache project. > > === Community === > Rya has being building a community of users and developers for the past 3 > years. There is currently an active workgroup with monthly meetings and the > number of participants in the meeting is increasing. > > === Core Developers === > The core developers are a diverse group of people who are either government > employees or former / current government contractors from different > companies. > > === Alignment === > Rya is built on top of Accumulo, an Apache project. > > == Known Risks == > === Orphaned Products === > There is a very small risk of becoming orphaned. The current contributors > are strongly committed to the project, there is a large enough number of > developers interested in contributing to the project, and we believe that > the support for the project will continue to grow from the interested > communities. > > === Inexperience with Open Source === > The initial committers have various degrees of experience with open source > projects - from very new to experienced. This project was open source > within government from the beginning. We are aware that it will be > different and more difficult functioning in a real open source environment. > We are enthusiastic and committed to learning the Apache way and being > successful in operating under Apache's development process. > > === Homogenous Developers === > The current list of developers form a heterogeneous group, with people for > academia, government, and industry, collaborating from distributed > geographic locations. We aim to expand the list of contributors with the > help of the Apache incubation process. > > === Reliance on Salaried Developers === > Many but not all of the developers working on the project are salaried > employees, paid to work on this project. They will continue to contribute > to the open source project. Some of the initial committers continued as > volunteers even if no longer employed to work on this project and they plan > to continue supporting the project. > > === Relationships with Other Apache Products === > Rya uses Apache Accumulo, Hadoop, Zookeeper, Maven. > > *Apache Jena API or Apache Commons RDF API could become the RDF API used > by Rya, but such a decision was not made. > *Apache Clerezza is database/triple store agnostic, and as such could be > complementary to Rya. > *Apache Stanbol focuses on providing semantic services, while Rya focuses > on providing a distributed triple store solution, with support for SPARQL > and OWL reasoning. > *Apache Marmotta provides an implementation of a Linked Data Platform, and > overlaps in some of the goals and functionality with Rya (RDF triple store, > SPARQL support among others). There are many opportunities for > collaboration with these projects and we are looking forward to such a > collaboration. > > === Apache Brand === > Rya has generated interest in the government. It also generated interest > within academia and industry. We believe that everyone could benefit from > having Rya as an open source project. Due to its strong ties to Accumulo, > an Apache project, and due to the values of the Apache Foundation, we > believe that Apache incubator is the right place for Rya. > > == Documentation == > Two peer-reviewed publications [1,2] about Rya were published in 2012 and > 2015. More documentation is available in the code. > > [1] Roshan Punnoose, Adina Crainiceanu, David Rapp. [[ > http://www.usna.edu/Users/cs/adina/research/Rya%5FCloudI%32%30%31%32.pdf|Rya: > A Scalable RDF Triple Store for the Clouds]]. Proceedings of the 1st > International Workshop on Cloud Intelligence, Pages 4:1-4:8, August 2012 > > [2] Roshan Punnoose, Adina Crainiceanu, David Rapp. [[ > http://www.usna.edu/Users/cs/adina/research/Rya_ISjournal2013.pdf|SPARQL in > the Clouds Using Rya]]. Information Systems, Volume 48, Pages 181-195, > March 2015 (Available online 23 July 2013) > > == Initial Source == > The code is currently in a private Github repository, due to security and > IP review processes. We intend to open it up via transferring the code to > an ASF repository. > > == Source and Intellectual Property Submission Plan == > The source code has been released under the Apache License, Version 2. > Software grant, and CCLAs have been submitted. ICLAs for initial committers > have been submitted or are in progress. > > == External Dependencies == > * [[http://rdf4j.org|OpenRDF Sesame]] (BSD license) > * [[http://www.geomesa.org/|GeoMesa]] (Apache License, Version 2.0) > * [[https://accumulo.apache.org/|Accumulo]] (Apache License, Version 2.0) > * [[https://hadoop.apache.org/|Hadoop]] (Apache License, Version 2.0) > * [[https://pig.apache.org/|Pig]] (Apache License, Version 2.0) > * [[http://tinkerpop.incubator.apache.org/|TinkerPop]] (Apache License, > Version 2.0) > > == Cryptography == > The proposal does not involve any cryptographic code. > > == Required Resources == > === Mailing lists === > * private@rya.incubator.apache.org > * dev@rya.incubator.apache.org > * commits@rya.incubator.apache.org > > === Git Repository === > https://git-wip-us.apache.org/repos/asf/incubator-rya.git > > === Issue Tracking === > JIRA Rya > > == Initial Committers == > * Roshan Punnoose, roshanp at gmail dot com > * David Rapp, dnrapp at ncsu dot edu > * Adina Crainiceanu, adinancr at gmail dot com > * Aaron Mihalik, aaron.mihalik at gmail dot com > * Puja Valiyil, pujav65 at gmail dot com > * Jennifer Brown, jennifer.brown at parsons dot com > * Steve Wagner, steve.r.wagner at gmail dot com > > == Affiliations == > * Roshan Punnoose, Enlighten IT Consulting > * David Rapp, North Carolina State University > * Adina Crainiceanu, US Naval Academy > * Aaron Mihalik, Parsons > * Puja Valiyil, Parsons > * Jennifer Brown, Parsons > * Steve Wagner, Enlighten IT Consulting > > == Sponsors == > === Champion === > * Adam Fuchs, ASF Member, afuchs at apache dot org > > === Nominated Mentors === > * Josh Elser josh dot elser at gmail dot com > * Edward J. Yoon edwardyoon at apache dot org > * Sean Busbey busbey at cloudera dot com > > We are seeking additional mentors > > === Sponsoring Entity === > Apache Incubator > --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org For additional commands, e-mail: general-help@incubator.apache.org