lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <>
Subject Re: Ocean and GData
Date Sat, 27 Sep 2008 13:46:34 GMT

Maybe it's just terminology, but to me GData is like RMI - something that facilitates transport.
 If Google had a system that's not real-time (i.e. you add something and you can't get it
to show up in search results immediately) they could still be using GData with that system
for indexing, querying, etc.

To me brining GData and GBase in the picture confuses things.  You could also compare the
real-time Lucene/Solr search goal to "like any normal RDBMS - you INSERT X and you can SELECT
X immediately".  Maybe the desire to compare with G* is because it's Google's and is known
for search?

Otis --
Sematext -- -- Lucene - Solr - Nutch

----- Original Message ----
> From: Jason Rutherglen <>
> To:
> Sent: Saturday, September 27, 2008 8:03:50 AM
> Subject: Re: Ocean and GData
> Hello Otis,
> GData and GBase to me sounds like they are short for Google Database.
> The goal with Ocean is to provide a Lucene based search database that
> provides out of the box functionality like the Google Data/Base
> offers.  Unlike MapReduce, there are no infrastructure whitepapers on
> how GData/Base works so I had to make a broad comparison rather than a
> specific one.  Realtime seems like a feature a search database should
> have to qualify as such and so GData is mentioned as the only known
> realtime solution (other than Twitter's Summize with I found out about
> later).  The service Google provides through the GData protocol seems
> to also be referred to as GData, but could simply be called the
> "infrastructure supporting Google's realtime search web services".
> Jason
> On Sat, Sep 27, 2008 at 1:17 AM, Otis Gospodnetic
> wrote:
> > Hi,
> >
> > Here is one thing that's been confusing me.  
> often 
> mentions GData and relates it to real-time search (to Ocean), as if it is GData 
> that provides real-time search functionality.  But isn't GData simply a 
> communication protocol (Atom with some custom additions by Google)?  If so, are 
> statements like "Ocean addresses this by providing the same functionality as 
> GData open sourced for use in any project" really correct?  If GData is just a 
> communication protocol, and Ocean is really primarily the search engine that is 
> capable of real-time search, then is it really correct to compare Ocean with 
> GData?  My feeling is that the thinking is:
> > "When I access Google's databases using GData I can see my changes to those 
> databases immediately".
> > But that doesn't make GData this real-time thing, but rather the backend, no?
> >
> >
> > Please enlighten me if I'm misunderstanding what GData is.  Thanks,
> >
> > Otis
> > --
> > Sematext -- -- Lucene - Solr - Nutch
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail:
> > For additional commands, e-mail:
> >
> >
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message