incubator-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Seetharam Venkatesh <venkat...@innerzeal.com>
Subject Re: [VOTE] Accept Rya into the Apache Incubator
Date Thu, 17 Sep 2015 17:22:48 GMT
+1, the proposal looks very interesting. We have a metadata and governance
solution built in Apache Atlas (http://atlas.incubator.apache.org) and use
an array of technologies to store the relationships using Titan and HBase.
This will be of interest to Atlas community.

If you are still looking for mentoring, I can volunteer and help.

Thanks,
Venkatesh

On Thu, Sep 17, 2015 at 10:03 AM Sean Busbey <busbey@cloudera.com> wrote:

> My apologies, I was on vacation and missed the start of this thread.
>
> late +1 (binding)
>
> On Wed, Sep 16, 2015 at 8:01 AM, Adina Crainiceanu <adina@usna.edu> wrote:
>
> > +1 of course :) I'm very excited at the prospect of joining the Apache
> > community!
> >
> >
> > --Adina Crainiceanu
> >
> > On Mon, Sep 14, 2015 at 11:17 AM, Adam Fuchs <afuchs@apache.org> wrote:
> >
> > > Thanks again for the healthy discussion on Rya. With that, I would like
> > to
> > > call a VOTE for accepting Rya as a new incubator project.
> > >
> > > The proposal text is included below, and is posted on the wiki here:
> > > https://wiki.apache.org/incubator/RyaProposal
> > >
> > > The discussion thread on Rya starts here:
> > >
> > >
> >
> http://mail-archives.apache.org/mod_mbox/incubator-general/201509.mbox/%3CCALt5_xJKtRcUr3WGjfrY77DYWF0-8DWi%3DzyS7hrMFTg%2BYAORjQ%40mail.gmail.com%3E
> > >
> > > The vote will be open until Thu Sep 17 15:15:00 UTC 2015.
> > >
> > > [ ] +1 accept Rya in the Incubator
> > > [ ] ±0
> > > [ ] -1 because...
> > >
> > > Thanks,
> > > Adam
> > >
> > >
> > > = Rya Proposal =
> > > == Abstract ==
> > > Rya (pronounced "ree-uh" /rēə/) is a cloud-based RDF triple store that
> > > supports SPARQL queries.
> > >
> > > == Proposal ==
> > > Rya is a scalable RDF data management system built on top of Accumulo.
> > Rya
> > > uses novel storage methods, indexing schemes, and query processing
> > > techniques that scale to billions of triples across multiple nodes. Rya
> > > provides fast and easy access to the data through SPARQL, a
> conventional
> > > query mechanism for RDF data.
> > >
> > > == Background ==
> > > RDF is a World Wide Web Consortium (W3C) standard used in describing
> > > resources on the Web. The smallest data unit is a triple consisting of
> > > subject, predicate, and object. Using this framework, it is very easy
> to
> > > describe any resource, not just Web related. For example, if you want
> to
> > > say that Alice is a professor, you can represent this as an RDF triple
> > like
> > > (Alice, rdf:type, Professor). In general, RDF is an open world
> framework
> > > that allows anyone to make any statement about any resource, which
> makes
> > it
> > >  a popular choice for expressing a large variety of data.
> > >
> > > RDF is used in conjunction with the Web Ontology Language (OWL). OWL
> is a
> > > framework for describing models or ontologies for RDF. It defines
> > concepts,
> > > relationships, and/or structure of RDF documents. These models can be
> > used
> > > to 'reason/infer' information about entities within a given domain. For
> > > example, you can express that a Professor is a sub class of Faculty,
> > > (Professor, rdfs:subClassOf, Faculty) and knowing that (Alice,
> rdf:type,
> > > Professor), it can be inferred that (Alice, rdf:type, Faculty).
> > >
> > > SPARQL is an RDF query language. Similar with SQL, SPARQL has SELECT
> and
> > > WHERE clauses; however, it is based on querying and retrieving RDF
> > triples.
> > >
> > > Work on Rya, a large scale distributed system for  storing and querying
> > RDF
> > > data, started in 2010.
> > >
> > > == Rationale ==
> > > With the increase in data size, there is a need for scalable systems
> for
> > > storing and retrieving RDF data in a cluster of nodes. We believe that
> > Rya
> > > can fulfill that role. We expect that communities within government,
> > health
> > > care, finance, and others who generate large amounts of RDF data will
> be
> > > most interested in this project.
> > >
> > > From its inception, the project operated with an Apache-style license,
> > but
> > > it was open to mostly US government-related projects only. We believe
> > that
> > > having the project and the development open for all will benefit both
> the
> > > project and the interested communities.
> > >
> > > == Current Status ==
> > > The project source code and documentation are currently hosted in a
> > private
> > > repository on Github. New users are added to the repository upon
> request.
> > >
> > > === Meritocracy ===
> > > Meritocracy is the model that we currently follow, and we want to
> build a
> > > larger and more diverse developer community by becoming an Apache
> > project.
> > >
> > > === Community ===
> > > Rya has being building a community of users and developers for the
> past 3
> > > years. There is currently an active workgroup with monthly meetings and
> > the
> > > number of participants in the meeting is increasing.
> > >
> > > === Core Developers ===
> > > The core developers are a diverse group of people who are either
> > government
> > > employees or former / current government contractors from different
> > > companies.
> > >
> > > === Alignment ===
> > > Rya is built on top of Accumulo, an Apache project.
> > >
> > > == Known Risks ==
> > > === Orphaned Products ===
> > > There is a very small risk of becoming orphaned. The current
> contributors
> > > are strongly committed to the project, there is a large enough number
> of
> > > developers interested in contributing to the project, and we believe
> that
> > > the support for the project will continue to grow from the interested
> > > communities.
> > >
> > > === Inexperience with Open Source ===
> > > The initial committers have various degrees of experience with open
> > source
> > > projects - from very new to experienced. This project was open source
> > > within government from the beginning. We are aware that it will be
> > > different and more difficult functioning in a real open source
> > environment.
> > > We are enthusiastic and committed to learning the Apache way and being
> > > successful in operating under Apache's development process.
> > >
> > > === Homogenous Developers ===
> > > The current list of developers form a heterogeneous group, with people
> > for
> > > academia, government, and industry, collaborating from distributed
> > > geographic locations. We aim to expand the list of contributors with
> the
> > > help of the Apache incubation process.
> > >
> > > === Reliance on Salaried Developers ===
> > > Many but not all of the developers working on the project are salaried
> > > employees, paid to work on this project. They will continue to
> contribute
> > > to the open source project. Some of the initial committers continued as
> > > volunteers even if no longer employed to work on this project and they
> > plan
> > > to continue supporting the project.
> > >
> > > === Relationships with Other Apache Products ===
> > > Rya uses Apache Accumulo, Hadoop, Zookeeper, Maven.
> > >
> > >  *Apache Jena API or Apache Commons RDF API could become the RDF API
> used
> > > by Rya, but such a decision was not made.
> > >  *Apache Clerezza is database/triple store agnostic, and as such could
> be
> > > complementary to Rya.
> > >  *Apache Stanbol focuses on providing semantic services, while Rya
> > focuses
> > > on providing a distributed triple store solution, with support for
> SPARQL
> > > and OWL reasoning.
> > >  *Apache Marmotta provides an implementation of a Linked Data Platform,
> > and
> > > overlaps in some of the goals and functionality with Rya (RDF triple
> > store,
> > > SPARQL support among others). There are many opportunities for
> > > collaboration with these projects and we are looking forward to such a
> > > collaboration.
> > >
> > > === Apache Brand ===
> > > Rya has generated interest in the government. It also generated
> interest
> > > within academia and industry. We believe that everyone could benefit
> from
> > > having Rya as an open source project. Due to its strong ties to
> Accumulo,
> > > an Apache project, and due to the values of the Apache Foundation, we
> > > believe that Apache incubator is the right place for Rya.
> > >
> > > == Documentation ==
> > > Two peer-reviewed publications [1,2] about Rya were published in 2012
> and
> > > 2015. More documentation is available in the code.
> > >
> > > [1] Roshan Punnoose, Adina Crainiceanu, David Rapp. [[
> > >
> > >
> >
> http://www.usna.edu/Users/cs/adina/research/Rya%5FCloudI%32%30%31%32.pdf|Rya
> <http://www.usna.edu/Users/cs/adina/research/Rya%5FCloudI%32%30%31%32.pdf%7CRya>
> > <
> http://www.usna.edu/Users/cs/adina/research/Rya%5FCloudI%32%30%31%32.pdf%7CRya
> >
> > > :
> > > A Scalable RDF Triple Store for the Clouds]]. Proceedings of the 1st
> > > International Workshop on Cloud Intelligence, Pages 4:1-4:8, August
> 2012
> > >
> > > [2] Roshan Punnoose, Adina Crainiceanu, David Rapp. [[
> > >
> http://www.usna.edu/Users/cs/adina/research/Rya_ISjournal2013.pdf|SPARQL
> > > in
> > > the Clouds Using Rya]]. Information Systems, Volume 48, Pages 181-195,
> > > March 2015 (Available online 23 July 2013)
> > >
> > > == Initial Source ==
> > > The code is currently in a private Github repository, due to security
> and
> > > IP review processes. We intend to open it up via transferring the code
> to
> > > an ASF repository.
> > >
> > > == Source and Intellectual Property Submission Plan ==
> > > The source code has been released under the Apache License, Version 2.
> > > Software grant, and CCLAs have been submitted. ICLAs for initial
> > committers
> > > have been submitted or are in progress.
> > >
> > > == External Dependencies ==
> > >  * [[http://rdf4j.org|OpenRDF Sesame]] (BSD license)
> > >  * [[http://www.geomesa.org/|GeoMesa]] (Apache License, Version 2.0)
> > >  * [[https://accumulo.apache.org/|Accumulo]] (Apache License, Version
> > 2.0)
> > >  * [[https://hadoop.apache.org/|Hadoop]] (Apache License, Version 2.0)
> > >  * [[https://pig.apache.org/|Pig]] (Apache License, Version 2.0)
> > >  * [[http://tinkerpop.incubator.apache.org/|TinkerPop]] (Apache
> License,
> > > Version 2.0)
> > >
> > > == Cryptography ==
> > > The proposal does not involve any cryptographic code.
> > >
> > > == Required Resources ==
> > > === Mailing lists ===
> > >  * private@rya.incubator.apache.org
> > >  * dev@rya.incubator.apache.org
> > >  * commits@rya.incubator.apache.org
> > >
> > > === Git Repository ===
> > > https://git-wip-us.apache.org/repos/asf/incubator-rya.git
> > >
> > > === Issue Tracking ===
> > > JIRA Rya
> > >
> > > == Initial Committers ==
> > >  * Roshan Punnoose, roshanp at gmail dot com
> > >  * David Rapp, dnrapp at ncsu dot edu
> > >  * Adina Crainiceanu, adinancr at gmail dot com
> > >  * Aaron Mihalik, aaron.mihalik at gmail dot com
> > >  * Puja Valiyil, pujav65 at gmail dot com
> > >  * Jennifer Brown, jennifer.brown at parsons dot com
> > >  * Steve Wagner, steve.r.wagner at gmail dot com
> > >
> > > == Affiliations ==
> > >  * Roshan Punnoose, Enlighten IT Consulting
> > >  * David Rapp, North Carolina State University
> > >  * Adina Crainiceanu, US Naval Academy
> > >  * Aaron Mihalik, Parsons
> > >  * Puja Valiyil, Parsons
> > >  * Jennifer Brown, Parsons
> > >  * Steve Wagner, Enlighten IT Consulting
> > >
> > > == Sponsors ==
> > > === Champion ===
> > >  * Adam Fuchs, ASF Member, afuchs at apache dot org
> > >
> > > === Nominated Mentors ===
> > >  * Josh Elser josh dot elser at gmail dot com
> > >  * Edward J. Yoon edwardyoon at apache dot org
> > >  * Sean Busbey busbey at cloudera dot com
> > >
> > > We are seeking additional mentors
> > >
> > > === Sponsoring Entity ===
> > > Apache Incubator
> > >
> >
> >
> >
> > --
> > Dr. Adina Crainiceanu
> > http://www.usna.edu/Users/cs/adina/
> >
>
>
>
> --
> Sean
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message