Categories: AnalyticsBusiness IntelligenceHadoopIn Memory

Comparison of Stream Processing Frameworks and Products

See how products, libraries, and frameworks that full under ‘streaming data analytics’ use cases are categorized and compared.

Streaming Analytics processes data in real time while it is in motion. This concept and technology emerged several years ago in financial trading, but it is growing increasingly important these days due to digitalization and Internet of Things (IoT). The following slide deck from a recent talk at a conference covers:

Real world success stories from different industries (Manufacturing, Retailing, Sports)
Alternative Frameworks and Products for Stream Processing
Complementary Relationship to Data Warehouse, Apache Hadoop, Statistics, Machine Learning, Open Source R, SAS, Matlab, etc.

Stream Processing Frameworks and Products

The following picture shows the key differences between frameworks (no matter if open source such as Apache Storm, Apache Flink, Apache Spark or closed source such as Amazon Kinesis) and products (such as TIBCO StreamBase / Live Datamart, IBM InfoSphere Streams, Software AG’s Apama).

Of course, you can implement everything by writing code and using one or more frameworks. However, besides several other benefits, the key differentiator of using a product is time to market. You can realize projects in weeks instead of months or even years. Delivering quickly is the number one priority of most enterprises these days in a world where the only constant is change!

I recommend that you choose one or two frameworks and one or two products to implement a proof of concept (POC); spend e.g. five days with each one to implement a streaming analytics use case, which includes integration of input feeds or sensors, correlation / sliding windows / patterns, simulation and testing, and a live user interface to monitor and act proactively. At the end, you can compare the results and decide which fits you best.

Fast Data and Streaming Analytics in the Era of Hadoop, R and Apache Spark

The following slide deck discusses the above topics in much more detail:

You are currently viewing a placeholder content from Default. To access the actual content, click the button below. Please note that doing so will share data with third-party providers.

Unblock content Accept required service and unblock content

More Information

Parts of this (extensive) slide deck were used for talks at several international conferences such as JavaOne 2015 in San Francisco. I appreciate any feedback about the content to improve it continuously…If you want to learn more about Streaming Analytics and its relation to Big Data and Apache Hadoop, I recommend the following InfoQ article: Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse.

Kai Waehner

bridging the gap between technical innovation and business value for real-time data streaming and applied AI.

Next Microservices = Death of the ESB? (2016, Meetup Dublin) »

Previous « Difference between a Data Warehouse and a Live Datamart?

Published by

Kai Waehner

10 years ago

UFC VIP Experience Worth the Price? Fan Review. Business Perspective. Tech Vision.

The Ultimate Fighting Championship (UFC) held Fight Night London on March 21, 2026, at The…

1 day ago

Analytics

Dashboards and Queries for Apache Kafka: Operational, Explorative, and the Role of the Context Engine

Dashboards are a popular way to make streaming data visible and useful, but they are…

2 weeks ago

Telecom

Data Streaming at MWC 2026: How Apache Kafka, Flink and Agentic AI Power Telecom Trends

Mobile World Congress (MWC) 2026 highlights the shift from batch systems to real time data…

3 weeks ago

Aviation

From Takeoff to Touchdown: Real-Time Aviation with Data Streaming at Qantas

This blog post explores how data streaming transforms airline operations by enabling real-time visibility, faster…

1 month ago

Book

The Ultimate Data Streaming Guide is Back – Second Edition of the Book and Industry Editions Now Available

The second edition of The Ultimate Data Streaming Guide is now available as a free…

2 months ago

Queues for Kafka

When (Not) to Use Queues for Kafka?

Apache Kafka has long been the foundation for real-time data streaming. With the release of…