Apache Kafka Streams + Machine Learning (Spark, TensorFlow, H2O.ai)

Posted in Analytics, Apache Kafka, Apache Spark, Big Data, Confluent, Hadoop, Integration, Kafka Connect, Kafka Streams, Machine Learning, Messaging, Microservices, Open Source, Stream Processing on May 23rd, 2017 by Kai Wähner

I started at Confluent in May 2017 to work as Technology Evangelist focusing on topics around the open source framework Apache Kafka. I think Machine Learning is one of the hottest buzzwords these days as it can add huge business value in any industry. Therefore, you will see various other posts from me around Apache Kafka (messaging), Kafka Connect (integration), Kafka Streams (stream processing), Confluent’s additional open source add-ons on top of Kafka (Schema Registry, Replicator, Auto Balancer, etc.). I will explain how to leverage all this for machine learning and other big data technologies in real world production scenarios.

Tags: , , , , , , , , , , , , , , , , , , ,

Why I Move (Back) to Open Source for Messaging, Integration and Stream Processing

Posted in Analytics, API Management, Big Data, Blockchain, Cloud, Cloud-Native, Docker, ESB, Hadoop, Internet of Things, Java / JEE, Machine Learning, Microservices, Middleware, SOA on May 1st, 2017 by Kai Wähner

After three great years at TIBCO Software, I move back to open source and join Confluent, a company focusing on the open source project Apache Kafka to build mission-critical, scalable infrastructures for messaging, integration and streaming analytics. Confluent is a Silicon Valley startup, still in the beginning of its journey, with a 700% growing business in 2016, and is exjustpected to grow significantly in 2017 again.

In this blog post, I want to share why I see the future for middleware and big data analytics in open source technologies, why I really like Confluent, what I will focus on in the next months, and why I am so excited about this next step in my career.

Tags: , , , , , , , , , , , , , , , , , ,

Visual Analytics + Open Source Deep Learning Frameworks

Posted in Analytics, Big Data, Cloud, Hadoop, Machine Learning on April 24th, 2017 by Kai Wähner

Deep Learning gets more and more traction. It basically focuses on one section of Machine Learning: Artificial Neural Networks. This article explains why Deep Learning is a game changer in analytics, when to use it, and how Visual Analytics allows business analysts to leverage the analytic models built by a (citizen) data scientist.

Tags: , , , , , , , , , , , , , , , , , , , , , ,

Blockchain, Integration, Streaming Analytics, Ethereum, Hyperledger

Posted in Analytics, Blockchain, ESB, Machine Learning, Middleware, SOA on February 24th, 2017 by Kai Wähner

In the fast few weeks, I have published a few articles, slide decks and videos around Blockchain, Middleware, Integration, Streaming Analytics, Ethereum, Hyperledger. I want to share the links here…

Blockchain – The Next Big Thing for Middleware

InfoQ article: “Blockchain – The Next Big Thing for Middleware”

Key takeaways:

  • Blockchain is not just for Bitcoin
  • A blockchain is a protocol and ledger for building an immutable historical record of transactions
  • There is no new technology behind blockchain, just established components combined in a new way
Tags: , , , , ,

Comparison: Data Preparation vs. Inline Data Wrangling in Machine Learning and Deep Learning Projects

Posted in Analytics, Big Data, Business Intelligence, Hadoop on February 13th, 2017 by Kai Wähner

I want to highlight a new presentation about Data Preparation in Data Science projects:

“Comparison of Programming Languages, Frameworks and Tools for Data Preprocessing and (Inline) Data Wrangling  in Machine Learning / Deep Learning Projects”

Data Preparation as Key for Success in Data Science Projects

A key task to create appropriate analytic models in machine learning or deep learning is the integration and preparation of data sets from various sources like files, databases, big data storages, sensors or social networks. This step can take up to 80% of the whole project.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Services

Posted in Analytics, Big Data, Business Intelligence, Cloud, Hadoop on November 15th, 2016 by Kai Wähner

In November 2016, I am at Big Data Spain in Madrid for the first time. A great conference with many awesome speakers and sessions about very hot topics such as Apache Hadoop, Spark Spark, Streaming Processing / Streaming Analytics and Machine Learning. If you are interested in big data, then this conference is for you! My two talks:

  • How to Apply Machine Learning to Real Time Processing” (see slides and video recording from a similar conference talk).
  • Comparison of Streaming Analytics Options” (the reason for this blog post; an updated version of my talk from JavaOne 2015)
Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Comparison of Open Source IoT Integration Frameworks

Posted in API Management, Cloud, Cloud-Native, Microservices, SOA on November 3rd, 2016 by Kai Wähner

In November 2016, I attended Devoxx conference in Casablanca. Around 1500 developers participated. A great event with many awesome speakers and sessions. Hot topics this year besides Java: Open Source Frameworks, Microservices (of course!), Internet of Things (including IoT Integration), Blockchain, Serverless Architectures.

I had three talks:

  • How to Apply Machine Learning to Real Time Processing
  • Comparison of Open Source IoT Integration Frameworks
  • Tools in Action – Live Demo of Open Source Project Flogo

In addition, I was interviewed by the Voxxed team about Big Data, Machine Learning and Internet of Things. The video will be posted on Voxxed website in the next weeks.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Open Source Project Flogo – Overview, Architecture and Live Demo

Posted in API Management, Cloud, Cloud-Native, Docker, ESB, Microservices, Middleware, SOA on November 3rd, 2016 by Kai Wähner

In October 2016, the open source IoT integration framework Flogo was published as first developer preview. This blog post is intended to give a first overview about Flogo. You can either browse through the slide deck or watch the videos.

Flogo

What is Project Flogo?

In short, Flogo is an ultra-lightweight integration framework powered by Go programming language. It is open source under the permissive BSD license and easily extendable for your own use cases. Flogo is used to develop IoT edge apps or cloud-native / serverless microservices. Therefore, it is complementary to other integration solutions and IoT cloud platforms.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Machine Learning Applied to Microservices

Posted in Analytics, Big Data, Business Intelligence, Cloud, Docker, Hadoop, Microservices, Middleware on October 20th, 2016 by Kai Wähner

I had two sessions at O’Reilly Software Architecture Conference in London in October 2016. It is the first #OReillySACon in London. A very good organized conference with plenty of great speakers and sessions. I can really recommend this conference and its siblings in other cities such as San Francisco or New York if you want to learn about good software architectures and new concepts, best practices and technologies. Some of the hot topics this year besides microservices are DevOps, serverless architectures and big data analytics respectively machine learning.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Comparison Of Log Analytics for Distributed Microservices – Open Source Frameworks, SaaS and Enterprise Products

Posted in Analytics, Big Data, Business Intelligence, Cloud, Hadoop, Microservices, SOA on October 20th, 2016 by Kai Wähner

I had two sessions at O’Reilly Software Architecture Conference in London in October 2016. It is the first #OReillySACon in London. A very good organized conference with plenty of great speakers and sessions. I can really recommend this conference and its siblings in other cities such as San Francisco or New York if you want to learn about good software architectures and new concepts, best practices and technologies. Some of the hot topics this year besides microservices are DevOps, serverless architectures and big data analytics.

I want to share the slide of my session about comparing open source frameworks, SaaS and Enterprise products regarding log analytics for distributed microservices:

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,