incubator-gora-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Enis Söztutar <enis....@gmail.com>
Subject Re: [DISCUSS] the Re: [VOTE] Gora Graduation Resolution on general@
Date Wed, 21 Dec 2011 02:48:02 GMT
Hi guys,

I just checked "the definition" of gora in terms of the recent discussions
here, and at general@incubator, and I am OK with updating the project
definition, and update the site documentation as well, if we can come up
with a short, but comprehensive definition.

I think the start point should be the project goals section on the
homepage, which states:

The overall goal for Gora is to become the standard data representation and
persistence framework for big data. The roadmap of Gora can be grouped as
follows.


   - *Data Persistence :* Persisting objects to Column stores such as
   HBase, Cassandra, Hypertable; key-value stores such as Voldermort, Redis,
   etc; SQL databases, such as MySQL, HSQLDB, flat files in local file system
   of Hadoop HDFS.
   - *Data Access :* An easy to use Java-friendly common API for accessing
   the data regardless of its location.
   - *Indexing :* Persisting objects to Lucene and Solr indexes,
   accessing/querying the data with Gora API.
   - *Analysis :* Accesing the data and making analysis through adapters
   for Apache Pig, Apache Hive and Cascading
   - *MapReduce support :* Out-of-the-box and extensive MapReduce (Apache
   Hadoop) support for data in the data store.

I really like this extended definition, which captures all of the aspects
for the current and possibly future modules. We can of course revise it, if
you think that it no longer represents the overall project vision.

>From the above definition, it is clear that key-value stores, column
oriented databases, cache/indexing layers, analysis, and mapreduce support
is targeted. I agree that the definition "ORM(-like) framework for NoSQL"
does not quite cut it, since NoSQL is, as mentioned before, very vague, and
most of the time it is just OM, instead of ORM.  For whether to include
Hadoop or not in the definition, it really does not matter that much. For
example, Apache Hive has it, but Pig does not in their respective home
pages.

So, my proposal is:
Apache Gora is an open source framework providing an in memory data model
and persistence for big data. Gora supports persisting to column stores,
key value stores, document stores and RDBMSs, and analyzing the data with
extensive Hadoop MapReduce support.

Feel free to edit / accept / or provide an alternative.

Thanks,
Enis


On Tue, Dec 20, 2011 at 4:50 PM, Mattmann, Chris A (388J) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Cool Lewis. Yeah I think GORA-71 and Sebb's issue will be
> important to address, and then we'll hopefully move forward to
> a successful VOTE completion next week.
>
> Cheers,
> Chris
>
> On Dec 20, 2011, at 12:08 PM, Lewis John Mcgibbney wrote:
>
> > Hi Chris,
> >
> > Yeah I would like us to pick the low hanging fruits before end of
> > week. If anything, it will at least appear that we've taken the
> > valuable IPMC opinions into consideration.
> >
> > Are we happy with changing, the core Gora project description to
> > "open-source software for mapping objects to NoSQL databases", as this
> > was Marcel's suggestion before he voiced his +1. The only other
> > alternative that was voiced was to head over to board@... wdyt?
> > Another critical point here was the possible ambiguity of including
> > datastores for Apache Hadoop e.g. whether or not Gora relied upon
> > Hadoop. Can we clarify whether this should be made explicit as well.
> > In my opinion this is not a potential block but it would be grand to
> > get it ironed out.
> >
> > If we also address Sebb's GORA-71 it would be good, I'm waiting on him
> > getting back to us via Jira, however if anyone else has their own
> > solutions then fire them up and I will begin work.
> >
> > Thanks
> >
> > On Tue, Dec 20, 2011 at 6:39 PM, Mattmann, Chris A (388J)
> > <chris.a.mattmann@jpl.nasa.gov> wrote:
> >> Hey Lewis,
> >>
> >> Great discuss thread, thanks for opening this conversation up.
> >> It would be nice to get Sebb's, Niclas, and Daniel's VOTE. I'll
> >> ping them to see what they would like to VOTE and then check perhaps at
> the
> >> end of this week/beginning of next week. At this point, we already have
> >> enough VOTEs to graduate, considering that Enis and Julien are members
> >> of the IPMC.
> >>
> >> I don't think it would hurt though to address the doc issues, maybe we
> can
> >> update it in the meanwhile before end of this week or beginning of next
> >> week?
> >>
> >> Cheers,
> >> Chris
> >>
> >> On Dec 20, 2011, at 7:21 AM, Lewis John Mcgibbney wrote:
> >>
> >>> Hi Guys,
> >>>
> >>> I think so far things have been going OK regarding the above
> >>> conversation which Chris directed towards general@.
> >>>
> >>> Is there anything we could do to push this through e.g. make the
> >>> necessary steps as a community to convince Sebb, Niclas Hendhman,
> >>> Daniel Shahaf that we a worthy of getting thier VOTE?
> >>>
> >>> There was some consensus regarding a possible change in documentation
> >>> which will hopefully paint a more accurate descriptive picture of what
> >>> Gora actually is... can we formally agree if this is necessary, also I
> >>> think it would be great to address the recently created GORA-71 issue.
> >>> By the looks of it, this is a fundamental barrier to us getting Sebb's
> >>> VOTE.
> >>>
> >>> Can anyone else provide any suggestions at this stage?
> >>>
> >>> Thanks.
> >>>
> >>> --
> >>> Lewis
> >>
> >>
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Chris Mattmann, Ph.D.
> >> Senior Computer Scientist
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 171-266B, Mailstop: 171-246
> >> Email: chris.a.mattmann@nasa.gov
> >> WWW:   http://sunset.usc.edu/~mattmann/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Adjunct Assistant Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >
> >
> >
> > --
> > Lewis
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message