Top
image credit: Adobe Stock

Testing and Monitoring Data Pipelines: Part Two

June 19, 2023

In part one of this article, we discussed how data testing can specifically test a data object (e.g., table, column, metadata) at one particular point in the data pipeline. While this technique is practical for in-database verifications – as tests are embedded directly in their data modeling efforts – it is tedious and time-consuming when end-to-end data pipelines are to be examined.

Data monitoring, on the other hand, helps build a holistic picture of your pipelines and their health. By tracking various metrics in multiple components in a data pipeline over time, data engineers can interpret anomalies in relation to the whole data ecosystem.

Read More on DATAVERSITY