lucene-java-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-java Wiki] Update of "OceanRealtimeSearch" by JasonRutherglen
Date Mon, 29 Sep 2008 15:00:32 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-java Wiki" for change notification.

The following page has been changed by JasonRutherglen:
http://wiki.apache.org/lucene-java/OceanRealtimeSearch

------------------------------------------------------------------------------
  = Introduction =
  
- Ocean enables realtime search written in Java using Lucene.  It is currently in patch phase
at [http://issues.apache.org/jira/browse/LUCENE-1313 LUCENE-1313].  Ocean offers a way for
Lucene based applications to take advantage of realtime search.  Realtime search makes search
systems more like a database.  This is probably why Google calls it's system [http://code.google.com/apis/gdata/
GData].  GData is offered as an online service and not software.  Ocean addresses this by
providing the same functionality as GData open sourced for use in any project.  GData does
not provide facets, this is something that Ocean can provide in the future.  [http://code.google.com/apis/base/
GBase] which is a cousin of GData offers location based search.  Ocean offers location based
search using [http://sourceforge.net/projects/locallucene/ LocalLucene].  By open sourcing
realtime search more functionality may be built in over time by the community which is something
GData being an online se
 rvice cannot do.  Google does not offer realtime search in it's search appliance.  I am unaware
of other search vendors offering realtime search.  
+ Ocean enables realtime search written in Java using Lucene.  It is currently in patch phase
at [http://issues.apache.org/jira/browse/LUCENE-1313 LUCENE-1313].  Ocean offers a way for
Lucene based applications to take advantage of realtime search.  Realtime search makes search
systems more like a database.  Realtime search offers similar functionality to that of a relational
database such as atomicity and transactional updates.  Like an RDBMS, where after an insert
one can immediately select what was just added without incurring any kind of extra cost, updated
documents are also immediately searchable.  
  
- There is a good [http://acmqueue.com/modules.php?name=Content&pa=showpage&pid=337
article] written by Adam Bosworth who seems to have headed up the GData project at Google.
 I think many of his points are quite valid.  It is worth mentioning the main points of the
article here as they also define the positive attributes of the Ocean open source search system.
+ There is a good [http://acmqueue.com/modules.php?name=Content&pa=showpage&pid=337
article] written by Adam Bosworth.  I think many of his points are quite valid.  It is worth
mentioning the main points of the article here as they also define the positive attributes
of the Ocean open source search system.
  
   * It is worth making things simple enough that one can harness Moore’s law in parallel
   * It is acceptable to be stale much of the time
@@ -56, +56 @@

  Example:
  
  <snapshot id="29.02" numDocs="10" maxDoc="25" deletedDocs="15">
- 
- <index snapshotid="974" id="787" segmentGeneration="401" type="disk" maxDoc="466" numDocs="442"
deletedDoc="95" minDocumentId="117" maxDocumentId="483" minSnapshotId="693" maxSnapshotId="116"
deleteFlushId="876" lastAppliedId="780" />
+ <index snapshotid="974" id="787" segmentGeneration="401" type="disk" maxDoc="466" numDocs="442"
deletedDoc="95" minDocumentId="117" maxDocumentId="483" minSnapshotId="693" maxSnapshotId="116"
deleteFlushId="876" lastAppliedId="780" /></snapshot>
- </snapshot>
  
  ||Name||Value||
  ||snapshotid||The id of the snapshot||

Mime
View raw message