Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Services

In November 2016, I am at Big Data Spain in Madrid for the first time. A great conference with many awesome speakers and sessions about very hot topics such as Apache Hadoop, Spark Spark, Streaming Processing / Streaming Analytics and Machine Learning. If you are interested in big data, then this conference is for you! My two talks:

  • How to Apply Machine Learning to Real Time Processing” (see slides and video recording from a similar conference talk).
  • Comparison of Streaming Analytics Options” (the reason for this blog post; an updated version of my talk from JavaOne 2015)

Here I wanna share the slides and a video recording of the latter one…

Abstract: Comparison of Stream Processing Options

This session discusses the technical concepts of stream processing / streaming analytics and how it is related to big data, mobile, cloud and internet of things. Different use cases such as predictive fault management or fraud detection are used to show and compare alternative frameworks and products for stream processing and streaming analytics.

The focus of the session lies on comparing

  • different open source frameworks such as Apache Apex, Apache Flink or Apache Spark Streaming
  • engines from software vendors such as IBM InfoSphere Streams, TIBCO StreamBase
  • cloud offerings such as AWS Kinesis.
  • real time streaming UIs such as Striim, Zoomdata or TIBCO Live Datamart.  Live demos will give the audience a good feeling about how to use these frameworks and tools.

The session will also discuss how stream processing is related to Apache Hadoop frameworks (such as MapReduce, Hive, Pig or Impala) and machine learning (such as R, Spark ML or H2O.ai).

Slides – Alternatives for Streaming Analytics

The following slide deck is a more extensive version of the talk at Big Data Spain (as the conference talks were only 30 minutes):

Click on the button to load the content from www.slideshare.net.

Load content

The video recording walks you through the above slide deck:

As always, I appreciate any comments, questions or other feedback.

Kai Waehner

bridging the gap between technical innovation and business value for real-time data streaming, processing and analytics

Recent Posts

10 FinTech Predictions That Depend on Real Time Data Streaming

Financial services companies are moving from batch processing to real time data flow. A data…

5 hours ago

Top Trends for Data Streaming with Apache Kafka and Flink in 2026

Each year brings new momentum to the data streaming space. In 2026, six key trends…

1 week ago

The Data Streaming Landscape 2026

Data streaming is now a core software category in modern data architecture. It powers real-time…

2 weeks ago

Life as a Lufthansa HON Circle Member: Inside the Ultimate Frequent Flyer Status

Reaching Lufthansa HON Circle status was both a personal milestone and a significant financial investment.…

2 weeks ago

CARIAD’s Unified Data Platform: A Data Streaming Automotive Success Story Behind Volkswagen’s Software-Defined Vehicles

The automotive industry transforms rapidly. Cars are now software-defined vehicles (SDVs) that demand constant, real-time…

3 weeks ago

Data Streaming Meets Lakehouse: Apache Iceberg for Unified Real-Time and Batch Analytics

Apache Iceberg is gaining momentum as the open table format of choice for modern data…

4 weeks ago