incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Incubator Wiki] Update of "GriffinProposal" by alexlv
Date Fri, 18 Nov 2016 19:02:42 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The "GriffinProposal" page has been changed by alexlv:

New page:
= Griffin Proposal =

== Abstract ==
Griffin is a Data Quality Service platform built on Apache Hadoop and Apache Spark. It provides
a framework process for defining data quality model, executing data quality measurement, automating
data profiling and validation, as well as a unified data quality visualization across multiple
data systems. It tries to address the data quality challenges in big data and streaming context.

== Proposal ==
Griffin is a open source Data Quality solution for distributed data systems at any scale in
both streaming or batch data context. When people use open source products (e.g. Apache Hadoop,
Apache Spark, Apache Kafka, Apache Storm), they always need a data quality service to build
his/her confidence on data quality processed by those platforms. Griffin creates a unified
process to define and construct data quality measurement pipeline across multiple data systems
to provide:
 * Automatic quality validation of the data
 * Data profiling and anomaly detection
 * Data quality lineage from upstream to downstream data systems.
 * Data quality health monitoring visualization
 * Shared infrastructure resource management

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message