Confluent Archives - Kai Waehner

6.6K views
16 minute read

Complex Event Processing (CEP) with Apache Flink: What It Is and When (Not) to Use It

ByKai Waehner
14. April 2026

Complex Event Processing is the most underused capability in Apache Flink. It detects meaningful event sequences in real time, fires only when a pattern is confirmed, and even catches events that never arrive. This guide covers what Flink CEP is, when to reach for it instead of stream processing, how to implement it, and what happened to the legacy CEP market.

Qantas Airline Data Streaming Platform with Apache Kafka for Airline Operations

4.5K views
5 minute read

From Takeoff to Touchdown: Real-Time Aviation with Data Streaming at Qantas

ByKai Waehner
16. February 2026

This blog post explores how data streaming transforms airline operations by enabling real-time visibility, faster decision-making, and improved customer experience. Using Qantas as a leading example, it highlights how a modern data streaming platform powered by Apache Kafka supports flight operations, crew coordination, baggage handling, and airport collaboration. It also explains technical integrations using Kafka Connect for AIDX message processing. The Qantas story illustrates how real-time data creates tangible business value across the aviation industry.

4.9K views
5 minute read

Diskless Kafka at FinTech Robinhood for Cost-Efficient Log Analytics and Observability

ByKai Waehner
22. January 2026

Diskless Kafka is transforming how fintech and financial services organizations handle observability and log analytics. By using the Kafka protocol with cloud-native object storage, companies like Robinhood reduce infrastructure costs and gain elastic scalability. This article explores how Robinhood leverages Kafka, Flink, and WarpStream to build a real-time platform that supports trading, monitoring, and compliance at scale.

30.0K views
16 minute read

The Data Streaming Landscape 2026

ByKai Waehner
5. December 2025

Data streaming is now a core software category in modern data architecture. It powers real-time use cases like fraud prevention, personalization, supply chain optimization, and AI automation. What started with open source Apache Kafka and Flink has grown into a critical layer for business operations. The 2026 Data Streaming Landscape shows the most relevant Data Streaming Platform evolution. These platforms connect systems, process data in motion, enforce governance, and support mission-critical workloads at scale. Kafka is the standard protocol, but protocol support alone is not enough. Enterprises need full feature compatibility, 24/7 support, and expert guidance for security, resilience, and cloud strategy.

Amazon MSK Forces a Kafka Cluster Migration from ZooKeeper to KRaft

14.9K views
4 minute read

In-Place Kafka Cluster Upgrades from ZooKeeper to KRaft are Not Possible with Amazon MSK

ByKai Waehner
7. September 2025

The Apache Kafka community introduced KIP-500 to remove ZooKeeper and replace it with KRaft, a new built-in consensus layer. This was a major milestone. It simplified operations, improved scalability, and reduced complexity. Importantly, Kafka supports smooth, zero downtime migrations from ZooKeeper to KRaft, even for large, business critical clusters. But NOT with Amazon MSK…

Mainframe Modernization and Integration with Data Streaming using Apache Kafka IBM MQ IIDR CDC Precisely Qlik

18.1K views
14 minute read

Mainframe Integration with Data Streaming: Architecture, Business Value, Real-World Success

ByKai Waehner
13. June 2025

The mainframe is evolving—not fading. With cloud-native features, AI acceleration, and quantum-safe encryption, platforms like IBM z16 and z17 remain central to critical industries. But modern demands require real-time data access and system agility. Apache Kafka and Flink make this possible by streaming data bi-directionally between DB2, IMS, and MQ and cloud analytics platforms. This enables event-driven architectures without disrupting core systems. This post outlines proven strategies—offloading, integration, and replacement—and includes real-world examples across industries. The result: lower costs, faster innovation, and smarter use of legacy systems.

Data Streaming with Confluent Meets SAP and Databricks for Agentic AI at Sapphire in Madrid

8.0K views
6 minute read

Data Streaming Meets the SAP Ecosystem and Databricks – Insights from SAP Sapphire Madrid

ByKai Waehner
28. May 2025

SAP Sapphire 2025 in Madrid brought together global SAP users, partners, and technology leaders to showcase the future of enterprise data strategy. Key themes included SAP’s Business Data Cloud (BDC) vision, Joule for Agentic AI, and the deepening SAP-Databricks partnership. A major topic throughout the event was the increasing need for real-time integration across SAP and non-SAP systems—highlighting the critical role of event-driven architectures and data streaming platforms like Confluent. This blog shares insights on how data streaming enhances SAP ecosystems, supports AI initiatives, and enables industry-specific use cases across transactional and analytical domains.

Enterprise Application Integration with Confliuent and Databricks for Oracle SAP Salesforce Servicenow et al

12.6K views
12 minute read

Databricks and Confluent in the World of Enterprise Software (with SAP as Example)

ByKai Waehner
12. May 2025

Enterprise data lives in complex ecosystems—SAP, Oracle, Salesforce, ServiceNow, IBM Mainframes, and more. This article explores how Confluent and Databricks integrate with SAP to bridge operational and analytical workloads in real time. It outlines architectural patterns, trade-offs, and use cases like supply chain optimization, predictive maintenance, and financial reporting, showing how modern data streaming unlocks agility, reuse, and AI-readiness across even the most SAP-centric environments.

Shift Left Architecture with Confluent Data Streaming and Databricks Lakehouse Medallion

12.9K views
9 minute read

Shift Left Architecture for AI and Analytics with Confluent and Databricks

ByKai Waehner
9. May 2025

Confluent and Databricks enable a modern data architecture that unifies real-time streaming and lakehouse analytics. By combining shift-left principles with the structured layers of the Medallion Architecture, teams can improve data quality, reduce pipeline complexity, and accelerate insights for both operational and analytical workloads. Technologies like Apache Kafka, Flink, and Delta Lake form the backbone of scalable, AI-ready pipelines across cloud and hybrid environments.

Confluent and Databricks for Data Integration and Stream Processing

18.0K views
10 minute read

Confluent Data Streaming Platform vs. Databricks Data Intelligence Platform for Data Integration and Processing

ByKai Waehner
5. May 2025

This blog explores how Confluent and Databricks address data integration and processing in modern architectures. Confluent provides real-time, event-driven pipelines connecting operational systems, APIs, and batch sources with consistent, governed data flows. Databricks specializes in large-scale batch processing, data enrichment, and AI model development. Together, they offer a unified approach that bridges operational and analytical workloads. Key topics include ingestion patterns, the role of Tableflow, the shift-left architecture for earlier data validation, and real-world examples like Uniper’s energy trading platform powered by Confluent and Databricks.

Technology Evangelist

Kai Waehner

Confluent

Complex Event Processing (CEP) with Apache Flink: What It Is and When (Not) to Use It

Global Executive Technology Strategist

Apache Kafka vs. Middleware (MQ, ETL, ESB) – Slides + Video

Deep Learning Example: Apache Kafka + Python + Keras + TensorFlow + Deeplearning4j

Why Databricks and Snowflake Speak the Kafka Protocol: Ingestion vs. Architecture

Process Intelligence Explained: Mining, Orchestration, and the Decision Gate