EAI

Slides from NoSQLmatters: “Big Data beyond Apache Hadoop – How to integrate ALL your data with Apache Camel and Talend”

April 26, 2013

Slides from my talk “Big Data beyond Apache Hadoop – How to integrate ALL your data” at NoSQLmatters 2013 in Cologne are online.

Here the abstract:

Big data represents a significant paradigm shift in enterprise technology. Big data radically changes the nature of the data management profession as it introduces new concerns about the volume, velocity and variety of corporate data.
Apache Hadoop is the open source defacto standard for implementing big data solutions on the Java platform. Hadoop consists of its kernel, MapReduce, and the Hadoop Distributed Filesystem (HDFS). A challenging task is to send all data to Hadoop for processing and storage (and then get it back to your application later), because in practice data comes from many different applications (SAP, Salesforce, Siebel, etc.) and databases (File, SQL, NoSQL), uses different technologies and concepts for communication (e.g. HTTP, FTP, RMI, JMS), and consists of different data formats using CSV, XML, binary data, or other alternatives.
This session shows different open source frameworks and tools to solve this challenging task. Learn how to use every thinkable data with Hadoop – without plenty of complex or redundant boilerplate code.

Here the slides:

http://www.slideshare.net/KaiWaehner/big-data-beyond-apache-hadoop-how-to-integrate-all-your-data

Share this post :

Kai Waehner

bridging the gap between technical innovation and business value for data integration, workflow orchestration, and agentic AI.

EAI

Slides from NoSQLmatters: “Big Data beyond Apache Hadoop – How to integrate ALL your data with Apache Camel and Talend”

Don't miss my next post. Subscribe!

Share this post :

Latest Posts

Data Integration vs Workflow Orchestration: Connecting Systems Is Not Coordinating the Work

Process Intelligence Landscape 2026: Mining, Orchestration, and the Agentic AI Shift

When to Use AMQP, JMS, Kafka, or MQTT: Trade-offs, Not a Winner

Kafka vs Flink vs Spark: Do You Really Need Real-Time?

Don’t miss my next post. Subscribe!

EAI

Slides from NoSQLmatters: “Big Data beyond Apache Hadoop – How to integrate ALL your data with Apache Camel and Talend”

Don't miss my next post. Subscribe!

Share this post :

Tag Cloud

Data Integration vs Workflow Orchestration: Connecting Systems Is Not Coordinating the Work

Process Intelligence Landscape 2026: Mining, Orchestration, and the Agentic AI Shift

When to Use AMQP, JMS, Kafka, or MQTT: Trade-offs, Not a Winner

Kafka vs Flink vs Spark: Do You Really Need Real-Time?

Don’t miss my next post. Subscribe!