Categories: Uncategorized

WJAX 2013 Slides online: Big Data beyond Apache Hadoop – How to integrate ALL your Data with Camel and Talend

I just want to publish my updated slide deck from WJAX 2013 in Munich. I have integrated some real world use cases for showing Hadoop integration with Apache Camel and Talend Open Studio for Big Data.

Abstract

Big data represents a significant paradigm shift in enterprise technology. Big data radically changes the nature of the data management profession as it introduces new concerns about the volume, velocity and variety of corporate data. Apache Hadoop is the open source defacto standard for implementing big data solutions on the Java platform. Hadoop consists of its kernel, MapReduce, and the Hadoop Distributed Filesystem (HDFS). A challenging task is to send all data to Hadoop for processing and storage (and then get it back to your application later), because in practice data comes from many different applications (SAP, Salesforce, Siebel, etc.) and databases (File, SQL, NoSQL), uses different technologies and concepts for communication (e.g. HTTP, FTP, RMI, JMS), and consists of different data formats using CSV, XML, binary data, or other alternatives. This session shows different open source frameworks and products to solve this challenging task. Learn how to use every thinkable data with Hadoop – without plenty of complex or redundant boilerplate code.

Slides

Click on the button to load the content from www.slideshare.net.

Load content

Kai Waehner

bridging the gap between technical innovation and business value for real-time data streaming, processing and analytics

Next Slides online: "Enterprise Integration Patterns Revisited" - Talk at OBJEKTspektrum Information Days 2013 »

Previous « JBoss OneDayTalk 2013: "NoSQL Integration with Apache Camel - MongoDB, CouchDB, Neo4j, Cassandra, HBase, Hazelcast, Riak, etc."

Published by

Kai Waehner

Tags: Apache CamelBig DataESBHadooptalend

12 years ago

Driving the Future: How Real-Time Data Streaming Is Powering Automotive Innovation

The automotive industry is rapidly shifting toward a software-defined, data-driven future. Real-time technologies like Apache…

3 days ago

FlinkSQL

Pinterest Fights Spam and Abuse with Kafka and Flink: A Deep Dive into the Guardian Rules Engine

Pinterest uses Apache Kafka and Flink to power Guardian, its real-time detection platform for spam,…

7 days ago

Agentic AI

Building Agentic AI with Amazon Bedrock AgentCore and Data Streaming Using Apache Kafka and Flink

Agentic AI goes beyond chatbots. These are autonomous systems that observe, reason, and act—continuously and…

1 week ago

Supply Chain

Inside FourKites Logistics Platform: Data Streaming for AI and End-to-End Visibility in the Supply Chain

Global supply chains face constant disruption. Trade conflicts, wars, inflation, and shifting regulations are making…

2 weeks ago

Kappa Architecture

The Rise of Kappa Architecture in the Era of Agentic AI and Data Streaming

The shift from Lambda to Kappa architecture reflects the growing demand for unified, real-time data…

3 weeks ago

FinOps

FinOps in Real Time: How Data Streaming Transforms Cloud Cost Management

FinOps bridges the gap between finance and engineering to control cloud spend in real time.…

4 weeks ago