Category: Apache Spark

Shift Left Architecture with Confluent Data Streaming and Databricks Lakehouse Medallion

Analytics

Shift Left Architecture for AI and Analytics with Confluent and Databricks

Confluent and Databricks enable a modern data architecture that unifies real-time streaming and lakehouse analytics. By combining shift-left principles with the structured layers of the

9. May 2025

Confluent and Databricks for Data Integration and Stream Processing

Apache Kafka

Confluent Data Streaming Platform vs. Databricks Data Intelligence Platform for Data Integration and Processing

This blog explores how Confluent and Databricks address data integration and processing in modern architectures. Confluent provides real-time, event-driven pipelines connecting operational systems, APIs, and

5. May 2025

Fraud Prevention in Mobility Services with Data Streaming using Apache Kafka and Flink with AI Machine Learning

Allgemein

Fraud Detection in Mobility Services (Ride-Hailing, Food Delivery) with Data Streaming using Apache Kafka and Flink

Mobility services like Uber, Grab, and FREE NOW (Lyft) rely on real-time data to power seamless trips, deliveries, and payments. But this real-time nature also

28. April 2025

Amazon MSK

The Data Streaming Landscape 2025

Data streaming is a new software category. It has grown from niche adoption to becoming a fundamental part of modern data architecture, leveraging open source

4. December 2024

Data Streaming Trends for 2025 - Leading with Apache Kafka and Flink

Amazon MSK

Top Trends for Data Streaming with Apache Kafka and Flink in 2025

Apache Kafka and Apache Flink are leading open-source frameworks for data streaming that serve as the foundation for cloud services, enabling organizations to unlock the

2. December 2024

Data Streaming Landscape 2024 around Kafka Flink and Cloud

Apache Flink

The Data Streaming Landscape 2024

The research company Forrester defines data streaming platforms as a new software category in a new Forrester Wave. Apache Kafka is the de facto standard

21. December 2023

Data Streaming Landscape 2023 with Apache Kafka Flink and much more

Apache Flink

The Data Streaming Landscape 2023

Data streaming is a new software category to process data in motion. Apache Kafka is the de facto standard used by over 100,000 organizations. Plenty

21. December 2022

Case Studies for Cloud Native Analytics with Data Warehouse Data Lake Data Streaming Lakehouse

Apache Kafka

Case Studies: Cloud-native Data Streaming for Data Warehouse Modernization

The concepts and architectures of a data warehouse, a data lake, and data streaming are complementary to solving business problems. Unfortunately, the underlying technologies are

18. July 2022

Machine Learning Trends of 2018 combined with the Apache Kafka Ecosystem

At OOP 2018 conference in Munich, I presented an updated version of my talk about building scalable, mission-critical microservices with the Apache Kafka ecosystem and

13. February 2018

Apache Kafka Streams + Machine Learning (Spark, TensorFlow, H2O.ai)

Apache Kafka Streams to build Real Time Streaming Microservices. Apply Machine Learning / Deep Learning using Spark, TensorFlow, H2O.ai, etc. to add AI. Embed Kafka

23. May 2017