Apache Kafka Streams + Machine Learning (Spark, TensorFlow, H2O.ai)

Posted in Analytics, Apache Kafka, Apache Spark, Big Data, Confluent, Hadoop, Integration, Kafka Connect, Kafka Streams, Machine Learning, Messaging, Microservices, Open Source, Stream Processing on May 23rd, 2017 by Kai Wähner

I started at Confluent in May 2017 to work as Technology Evangelist focusing on topics around the open source framework Apache Kafka. I think Machine Learning is one of the hottest buzzwords these days as it can add huge business value in any industry. Therefore, you will see various other posts from me around Apache Kafka (messaging), Kafka Connect (integration), Kafka Streams (stream processing), Confluent’s additional open source add-ons on top of Kafka (Schema Registry, Replicator, Auto Balancer, etc.). I will explain how to leverage all this for machine learning and other big data technologies in real world production scenarios.

Tags: , , , , , , , , , , , , , , , , , , ,

Visual Analytics + Open Source Deep Learning Frameworks

Posted in Analytics, Big Data, Cloud, Hadoop, Machine Learning on April 24th, 2017 by Kai Wähner

Deep Learning gets more and more traction. It basically focuses on one section of Machine Learning: Artificial Neural Networks. This article explains why Deep Learning is a game changer in analytics, when to use it, and how Visual Analytics allows business analysts to leverage the analytic models built by a (citizen) data scientist.

Tags: , , , , , , , , , , , , , , , , , , , , , ,

Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Services

Posted in Analytics, Big Data, Business Intelligence, Cloud, Hadoop on November 15th, 2016 by Kai Wähner

In November 2016, I am at Big Data Spain in Madrid for the first time. A great conference with many awesome speakers and sessions about very hot topics such as Apache Hadoop, Spark Spark, Streaming Processing / Streaming Analytics and Machine Learning. If you are interested in big data, then this conference is for you! My two talks:

  • How to Apply Machine Learning to Real Time Processing” (see slides and video recording from a similar conference talk).
  • Comparison of Streaming Analytics Options” (the reason for this blog post; an updated version of my talk from JavaOne 2015)
Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Comparison of Open Source IoT Integration Frameworks

Posted in API Management, Cloud, Cloud-Native, Microservices, SOA on November 3rd, 2016 by Kai Wähner

In November 2016, I attended Devoxx conference in Casablanca. Around 1500 developers participated. A great event with many awesome speakers and sessions. Hot topics this year besides Java: Open Source Frameworks, Microservices (of course!), Internet of Things (including IoT Integration), Blockchain, Serverless Architectures.

I had three talks:

  • How to Apply Machine Learning to Real Time Processing
  • Comparison of Open Source IoT Integration Frameworks
  • Tools in Action – Live Demo of Open Source Project Flogo

In addition, I was interviewed by the Voxxed team about Big Data, Machine Learning and Internet of Things. The video will be posted on Voxxed website in the next weeks.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Open Source Project Flogo – Overview, Architecture and Live Demo

Posted in API Management, Cloud, Cloud-Native, Docker, ESB, Microservices, Middleware, SOA on November 3rd, 2016 by Kai Wähner

In October 2016, the open source IoT integration framework Flogo was published as first developer preview. This blog post is intended to give a first overview about Flogo. You can either browse through the slide deck or watch the videos.

Flogo

What is Project Flogo?

In short, Flogo is an ultra-lightweight integration framework powered by Go programming language. It is open source under the permissive BSD license and easily extendable for your own use cases. Flogo is used to develop IoT edge apps or cloud-native / serverless microservices. Therefore, it is complementary to other integration solutions and IoT cloud platforms.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Machine Learning Applied to Microservices

Posted in Analytics, Big Data, Business Intelligence, Cloud, Docker, Hadoop, Microservices, Middleware on October 20th, 2016 by Kai Wähner

I had two sessions at O’Reilly Software Architecture Conference in London in October 2016. It is the first #OReillySACon in London. A very good organized conference with plenty of great speakers and sessions. I can really recommend this conference and its siblings in other cities such as San Francisco or New York if you want to learn about good software architectures and new concepts, best practices and technologies. Some of the hot topics this year besides microservices are DevOps, serverless architectures and big data analytics respectively machine learning.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Comparison Of Log Analytics for Distributed Microservices – Open Source Frameworks, SaaS and Enterprise Products

Posted in Analytics, Big Data, Business Intelligence, Cloud, Hadoop, Microservices, SOA on October 20th, 2016 by Kai Wähner

I had two sessions at O’Reilly Software Architecture Conference in London in October 2016. It is the first #OReillySACon in London. A very good organized conference with plenty of great speakers and sessions. I can really recommend this conference and its siblings in other cities such as San Francisco or New York if you want to learn about good software architectures and new concepts, best practices and technologies. Some of the hot topics this year besides microservices are DevOps, serverless architectures and big data analytics.

I want to share the slide of my session about comparing open source frameworks, SaaS and Enterprise products regarding log analytics for distributed microservices:

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Streaming Analytics with Analytic Models (R, Spark MLlib, H20, PMML)

Posted in Analytics, Big Data, Business Intelligence, Hadoop, In Memory, NoSQL on March 3rd, 2016 by Kai Wähner

In March 2016, I had a talk at Voxxed Zurich about “How to Apply Machine Learning and Big Data Analytics to Real Time Processing”.

Kai_Waehner_at_Voxxed_Zurich

Finding Insights with R, H20, Apache Spark MLlib, PMML and TIBCO Spotfire

Big Data” is currently a big hype. Large amounts of historical data are stored in Hadoop or other platforms. Business Intelligence tools and statistical computing are used to draw new knowledge and to find patterns from this data, for example for promotions, cross-selling or fraud detection. The key challenge is how these findings can be integrated from historical data into new transactions in real time to make customers happy, increase revenue or prevent fraud.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Microservices = Death of the ESB? (2016, Meetup Dublin)

Posted in API Management, Big Data, Cloud, Docker, EAI, ESB, IT Conferences, Microservices, SOA on January 29th, 2016 by Kai Wähner

I was invited to speak at Microservices Meetup Dublin this week. I updated my slide deck “Microservices – Death of the ESB?” … The meetup was fully booked with a waiting list; around 120 attendees came to Gild‘s office. (see attached link).

If you have not seen the slide deck last year, you should definitely take a look at this updated version with more recent information. I also incorporated valuable information from discussions with attendees in 2015’s sessions about this topic.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Comparison of Stream Processing Frameworks and Products

Posted in Analytics, Business Intelligence, Hadoop, In Memory on October 25th, 2015 by Kai Wähner

See how products, libraries, and frameworks that full under ‘streaming data analytics’ use cases are categorized and compared.

Streaming Analytics processes data in real time while it is in motion. This concept and technology emerged several years ago in financial trading, but it is growing increasingly important these days due to digitalization and Internet of Things (IoT). The following slide deck from a recent talk at a conference covers:

  • Real world success stories from different industries (Manufacturing, Retailing, Sports)
  • Alternative Frameworks and Products for Stream Processing
  • Complementary Relationship to Data Warehouse, Apache Hadoop, Statistics, Machine Learning, Open Source R, SAS, Matlab, etc.
Tags: , , , , , , , , , , , , , , , , , , , , ,