Categories: AnalyticsBig DataBusiness IntelligenceHadoop

Comparison of Stream Processing and Streaming Analytics Alternatives (Apache Storm, Spark, IBM InfoSphere Streams, TIBCO StreamBase, Software AG Apama)

The demand for stream processing is increasing a lot these days. Frameworks (Apache Storm, Spark) and products (e.g. IBM InfoSphere Streams, TIBCO StreamBase, Software AG Apama) for stream processing and streaming analytics are getting a lot of attention these days. The reason is that often processing big volumes of data is not enough. Data has to be processed fast, so that a firm can react to changing business conditions in real time. This is required for trading, fraud detection, system monitoring, and many other examples. A “too late architecture” cannot realize these use cases.

There is not much literature available about stream processing and streaming analytics in 2014. Analyst Forrester has published a report recently: The Forrester Wave™: Big Data Streaming Analytics Platforms, Q3 2014. Leaders are Software AG (Apama), IBM (InfoSphere Streams), TIBCO (StreamBase), SAP (Event Stream Processor) and Informatica.

Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse

I have written an article for InfoQ, which gives some more details about stream processing and streaming analytics:

Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse

This article discusses what stream processing is, how it fits into a big data architecture with Hadoop and a data warehouse (DWH), when stream processing makes sense, and what technologies and products you can choose from. The following is just a snippet of the article describing the components of a stream processing solution.

Comparison of Alternatives for Stream Processing / Streaming Analytics

Stream processing can be implemented by doing-it-yourself, using a framework or a product. Doing-it-yourself should not be an option in most cases, because there are good open source frameworks available for free. However, a stream processing product might solve many of your issues out-of-the-box, while a framework still requires a lot of self-coding and the Total Cost of Ownership might be much higher than expected compared to a product.

From a technical perspective, the following components are required to solve all “streaming challenges” and implement a stream processing use case:

Server: An ultra-low-latency application server optimized for processing real-time streaming event data at high throughputs and low latency (usually in-memory).
IDE: A development environment, which ideally offers visual development, debugging and testing of stream processing processes using streaming operators for filtering, aggregation, correlation, time windows, transformation, etc. Extendibility, e.g. integration of libraries or building custom operators and connectors, is also important.
Connectors: Pre-built data connectivity to communicate with data sources such as database (e.g. MySQL, Oracle, IBM DB2), DWH (e.g. HP Vertica), market data (e.g. Bloomberg, FIX, Reuters), statistics (e.g. R, MATLAB, TERR) or technology (e.g. JMS, Hadoop, Java, .NET).
Streaming Analytics: A user interface, which allows monitoring, management and real-time analytics for live streaming data. Automated alerts and human reactions should also be possible.
Live Data Mart and/or Operational Business Intelligence: Aggregates streaming data for ad-hoc, end-user, query access, alerting, dynamic aggregation, and user management. Live stream visualization, graphing, charting, slice and dice are also important.

As of end-2014, only a few products are available on the market that offer these components. Often, a lot of custom coding is required instead of using a full product for stream processing.

Apache Storm, Spark, IBM InfoSphere Streams, TIBCO StreamBase, SAG Apama

The two most-widespread open source frameworks for stream processing are Apache Storm and Spark. IBM InfoSphere Streams, TIBCO StreamBase and Software AG’s Apama are important players from proprietary vendors.

Please read my InfoQ article for more details about these products, and how they relate to a Data Warehouse (DWH) and Apache Hadoop. Here is the link: Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse.

As always, I appreciate all feedback and discussions…

Kai Waehner

bridging the gap between technical innovation and business value for real-time data streaming and applied AI.

Next A New Front for SOA: Open API and API Management as Game Changer »

Previous « Intelligent Business Process Management Suites (iBPMS) - The Next-Generation BPM for a Big Data World

Published by

Kai Waehner

Tags: Apache SparkApache StormApamaBig DataBusiness Intelligencedata warehousedwhHadoopIBMIBM InfoSphere StreamsLiveViewopen sourceReal TimeSoftware AGStream ProcessingStreamBaseTIBCO

12 years ago

Dashboards and Queries for Apache Kafka: Operational, Explorative, and the Role of the Context Engine

Dashboards are a popular way to make streaming data visible and useful, but they are…

6 days ago

Telecom

Data Streaming at MWC 2026: How Apache Kafka, Flink and Agentic AI Power Telecom Trends

Mobile World Congress (MWC) 2026 highlights the shift from batch systems to real time data…

2 weeks ago

Aviation

From Takeoff to Touchdown: Real-Time Aviation with Data Streaming at Qantas

This blog post explores how data streaming transforms airline operations by enabling real-time visibility, faster…

4 weeks ago

Book

The Ultimate Data Streaming Guide is Back – Second Edition of the Book and Industry Editions Now Available

The second edition of The Ultimate Data Streaming Guide is now available as a free…

1 month ago

Queues for Kafka

When (Not) to Use Queues for Kafka?

Apache Kafka has long been the foundation for real-time data streaming. With the release of…

2 months ago

Diskless Kafka

Diskless Kafka at FinTech Robinhood for Cost-Efficient Log Analytics and Observability

Diskless Kafka is transforming how fintech and financial services organizations handle observability and log analytics.…

2 months ago

Comparison of Stream Processing and Streaming Analytics Alternatives (Apache Storm, Spark, IBM InfoSphere Streams, TIBCO StreamBase, Software AG Apama)

Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse

Comparison of Alternatives for Stream Processing / Streaming Analytics

Apache Storm, Spark, IBM InfoSphere Streams, TIBCO StreamBase, SAG Apama

Related Post

Recent Posts

Dashboards and Queries for Apache Kafka: Operational, Explorative, and the Role of the Context Engine

Data Streaming at MWC 2026: How Apache Kafka, Flink and Agentic AI Power Telecom Trends

From Takeoff to Touchdown: Real-Time Aviation with Data Streaming at Qantas

The Ultimate Data Streaming Guide is Back – Second Edition of the Book and Industry Editions Now Available

When (Not) to Use Queues for Kafka?

Diskless Kafka at FinTech Robinhood for Cost-Efficient Log Analytics and Observability