Tag: Data Quality

Data Catalog

Beyond Enterprise Data Lineage: The Case for a Platform-Independent Data Catalog

Most organizations start their data governance journey by asking how to track where data comes from and where it goes. They quickly discover a harder

18. May 2026

Shift Left Architecture at Siemens with Stream Processing using Apache Kafka and Flink

Apache Flink

Shift Left Architecture at Siemens: Real-Time Innovation in Manufacturing and Logistics with Data Streaming

Industrial enterprises face increasing pressure to move faster, automate more, and adapt to constant change—without compromising reliability. Siemens Digital Industries addresses this challenge by combining

11. April 2025

Apache Flink

The Top 20 Problems with Batch Processing (and How to Fix Them with Data Streaming)

Batch processing introduces delays, complexity, and data quality issues that modern businesses can no longer afford. This article outlines the most common problems with batch

1. April 2025

Data Lineage for Data Streaming with OpenLineage Apache Kafka and Flink

Apache Flink

Open Standards for Data Lineage: OpenLineage for Batch AND Streaming

One of the greatest wishes of companies is end-to-end visibility in their operational and analytical workflows. Where does data come from? Where does it go?

13. May 2024

Comparison: Data Preparation vs. Inline Data Wrangling in Machine Learning and Deep Learning Projects

Data Preparation: Comparison of Programming Languages, Frameworks and Tools for Data Preprocessing and (Inline) Data Wrangling in Machine Learning / Deep Learning Projects.

13. February 2017