manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gustavo Beneitez <gustavo.benei...@gmail.com>
Subject ManifoldCF database model
Date Tue, 16 Oct 2018 15:50:55 GMT
Hi all,

how do you do? I was wandering if there is any technical document about
what is the meaning of each table in database, the relationship between
documents, repositories, jobs and any other output connector (some kind of
a database model).

We are facing some "garbage issues", jobs are created, duplicated, related
to transformations, linked to outputs (Elastic Search), played and finally
deleted, but in the end documents that should be also deleted against the
output connector,  sometimes they still are there, don't know if they are
visible because they point to an existing job, an unexpected job end or any
other failure.

We need to understand the database model in order to check when documents
stored in Elastic can be safely removed since they no longer are referred
by any process. A process that should be executed periodically every week,
for example.

Thanks in advance!

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message