Categories: EAI

Slides online: “Big Data beyond Apache Hadoop – How to Integrate ALL your Data” – JavaOne 2013

Slides from my session “Big Data beyond Apache Hadoop – How to Integrate ALL your Data” at JavaOne 2013 in San Francisco are online.

Abstract

Big data represents a significant paradigm shift in enterprise technology. Big data radically changes the nature of the data management profession as it introduces new concerns about the volume, velocity and variety of corporate data.

Apache Hadoop is the open source defacto standard for implementing big data solutions on the Java platform. Hadoop consists of its kernel, MapReduce, and the Hadoop Distributed Filesystem (HDFS). A challenging task is to send all data to Hadoop for processing and storage (and then get it back to your application later), because in practice data comes from many different applications (SAP, Salesforce, Siebel, etc.) and databases (File, SQL, NoSQL), uses different technologies and concepts for communication (e.g. HTTP, FTP, RMI, JMS), and consists of different data formats using CSV, XML, binary data, or other alternatives.

This session shows different open source frameworks and products (especially Apache Camel and Talend Open Studio for Big Data) to solve this challenging task. Learn how to use every thinkable data with Hadoop – without plenty of complex or redundant boilerplate code.

Slides

You are currently viewing a placeholder content from Default. To access the actual content, click the button below. Please note that doing so will share data with third-party providers.

Unblock content Accept required service and unblock content

More Information

Kai Waehner

bridging the gap between technical innovation and business value for real-time data streaming and applied AI.

Next How to choose the right Open Source Integration Framework - Apache Camel (JBoss, Talend), Spring Integration (Pivotal) or Mule ESB? - JavaOne 2013 »

Previous « Slides online: "NoSQL takes over! Alternatives for Integration of NoSQL databases" - NoSQL Roadshow Zurich

Published by

Kai Waehner

Tags: Apache CamelappleBig DataClouderaEAIEnterprise Application IntegrationHadoopHortonworksMap ReduceMapRPivotalSpring IntegrationtalendTalend Open Studio

13 years ago

Why Databricks and Snowflake Speak the Kafka Protocol: Ingestion vs. Architecture

Databricks and Snowflake now speak the Kafka protocol. But using the Kafka API to feed…

2 days ago

Choosing an ERP for Manufacturing: How AI Is Reshaping the Vendor Landscape

ERP vendor selection for manufacturing is not a product decision. It is a strategic bet…

1 week ago

Process Intelligence

Process Intelligence Explained: Mining, Orchestration, and the Decision Gate

Process intelligence is not a single tool. It combines process mining, process orchestration, and a…

2 weeks ago

ERP Migration to SAP S/4HANA and Beyond: Lessons Learned from German Manufacturing

ERP modernization fails when the technology leads and the process work follows. Three German manufacturers…

3 weeks ago

Data Catalog

Beyond Enterprise Data Lineage: The Case for a Platform-Independent Data Catalog

Most organizations start their data governance journey by asking how to track where data comes…

1 month ago

Data Ownership in the Age of Agentic AI: Why SAP’s API Policy Forces a Data Integration Reckoning for Every Enterprise

Every enterprise is being told to go agentic. Meanwhile, the platforms holding your most critical…

2 months ago