jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From dave_gough <bagel10002...@googlemail.com>
Subject Replication Strategy for Jackrabbit
Date Wed, 10 Feb 2010 09:22:11 GMT

Hi,

Can anyone help regarding the best way of migrating and syncing from one
"live" repository to a "fail-over" repository on a totally different server?
Here are some background facts:

1) We are using the
org.apache.jackrabbit.core.persistence.bundle.MySqlPersistenceManager.
2) Blobs are stored externally on disk (a SAN) and are up to 1 TB
3) Indexes are about 300MB
4) We typically have about 100MB of multimedia content ingested into the
repository every hour. 

We were initially thinking of using MySQL replication for the database and
rsync for the indexes and blobs (after an initial transfer of blobs by
external disk) to start things off and also as an ongoing means of
replicating the repsotiry to a fail-over system, but we have concerns about
the consistency between the data in MySQL and the indexes/blobs on the file
system.

Some question we have are:

How fault-tolerant is the repository if there are missing blobs referenced
by the database. 

Is there a cleanup that occurs if a blob is not found or is a warning
issued. I am aware of the "consistencyCheck/Fix" attributes in the
repository.xml config and the GarbageCollector which we run every day at
02h00. I presume these all affect any inconsistencies between the database
and FileSystem in some way.

We are not afraid of losing some data, so if we have to fail over and some
data is missing that is OK as we can just re-ingest any missing content, but
we are concerned about corrupting the repository through inconsistent
database/file-system and indexes.

We cannot use Jackrabbit clustering as the FileSystems are not shared as one
of the servers is an off-site fail-over server.

Does this seem a workable solution, or is there a better way of doing this,

regards

Dave Gough

-- 
View this message in context: http://n4.nabble.com/Replication-Strategy-for-Jackrabbit-tp1475531p1475531.html
Sent from the Jackrabbit - Users mailing list archive at Nabble.com.

Mime
View raw message