Blockchain, Integration, Streaming Analytics, Ethereum, Hyperledger

Posted in Analytics, Blockchain, ESB, Machine Learning, Middleware, SOA on February 24th, 2017 by Kai Wähner

In the fast few weeks, I have published a few articles, slide decks and videos around Blockchain, Middleware, Integration, Streaming Analytics, Ethereum, Hyperledger. I want to share the links here…

Blockchain – The Next Big Thing for Middleware

InfoQ article: “Blockchain – The Next Big Thing for Middleware”

Key takeaways:

  • Blockchain is not just for Bitcoin
  • A blockchain is a protocol and ledger for building an immutable historical record of transactions
  • There is no new technology behind blockchain, just established components combined in a new way
Tags: , , , , ,

Comparison: Data Preparation vs. Inline Data Wrangling in Machine Learning and Deep Learning Projects

Posted in Analytics, Big Data, Business Intelligence, Hadoop on February 13th, 2017 by Kai Wähner

I want to highlight a new presentation about Data Preparation in Data Science projects:

“Comparison of Programming Languages, Frameworks and Tools for Data Preprocessing and (Inline) Data Wrangling  in Machine Learning / Deep Learning Projects”

Data Preparation as Key for Success in Data Science Projects

A key task to create appropriate analytic models in machine learning or deep learning is the integration and preparation of data sets from various sources like files, databases, big data storages, sensors or social networks. This step can take up to 80% of the whole project.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Services

Posted in Analytics, Big Data, Business Intelligence, Cloud, Hadoop on November 15th, 2016 by Kai Wähner

In November 2016, I am at Big Data Spain in Madrid for the first time. A great conference with many awesome speakers and sessions about very hot topics such as Apache Hadoop, Spark Spark, Streaming Processing / Streaming Analytics and Machine Learning. If you are interested in big data, then this conference is for you! My two talks:

  • How to Apply Machine Learning to Real Time Processing” (see slides and video recording from a similar conference talk).
  • Comparison of Streaming Analytics Options” (the reason for this blog post; an updated version of my talk from JavaOne 2015)
Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Machine Learning Applied to Microservices

Posted in Analytics, Big Data, Business Intelligence, Cloud, Docker, Hadoop, Microservices, Middleware on October 20th, 2016 by Kai Wähner

I had two sessions at O’Reilly Software Architecture Conference in London in October 2016. It is the first #OReillySACon in London. A very good organized conference with plenty of great speakers and sessions. I can really recommend this conference and its siblings in other cities such as San Francisco or New York if you want to learn about good software architectures and new concepts, best practices and technologies. Some of the hot topics this year besides microservices are DevOps, serverless architectures and big data analytics respectively machine learning.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Comparison Of Log Analytics for Distributed Microservices – Open Source Frameworks, SaaS and Enterprise Products

Posted in Analytics, Big Data, Business Intelligence, Cloud, Hadoop, Microservices, SOA on October 20th, 2016 by Kai Wähner

I had two sessions at O’Reilly Software Architecture Conference in London in October 2016. It is the first #OReillySACon in London. A very good organized conference with plenty of great speakers and sessions. I can really recommend this conference and its siblings in other cities such as San Francisco or New York if you want to learn about good software architectures and new concepts, best practices and technologies. Some of the hot topics this year besides microservices are DevOps, serverless architectures and big data analytics.

I want to share the slide of my session about comparing open source frameworks, SaaS and Enterprise products regarding log analytics for distributed microservices:

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Characteristics of a Good Visual Analytics and Data Discovery Tool

Posted in Analytics, Big Data, Business Intelligence, Hadoop on July 28th, 2016 by Kai Wähner

Visual Analytics and Data Discovery allow analysis of big data sets to find insights and valuable information. This is much more than just classical Business Intelligence (BI). See this article for more details and motivation: “Using Visual Analytics to Make Better Decisions: the Death Pill Example“. Let’s take a look at important characteristics to choose the right tool for your use cases.

Visual Analytics Tool Comparison and Evaluation

Several tools are available on the market for Visual Analytics and Data Discovery. Three of the most well known options are Tableau, Qlik and TIBCO Spotfire. Use the following list to compare and evaluate different tools to make the right decision for your project:

Tags: , , , , , , , , , , , , , , , , , , , ,

Streaming Analytics with Analytic Models (R, Spark MLlib, H20, PMML)

Posted in Analytics, Big Data, Business Intelligence, Hadoop, In Memory, NoSQL on March 3rd, 2016 by Kai Wähner

In March 2016, I had a talk at Voxxed Zurich about “How to Apply Machine Learning and Big Data Analytics to Real Time Processing”.

Kai_Waehner_at_Voxxed_Zurich

Finding Insights with R, H20, Apache Spark MLlib, PMML and TIBCO Spotfire

Big Data” is currently a big hype. Large amounts of historical data are stored in Hadoop or other platforms. Business Intelligence tools and statistical computing are used to draw new knowledge and to find patterns from this data, for example for promotions, cross-selling or fraud detection. The key challenge is how these findings can be integrated from historical data into new transactions in real time to make customers happy, increase revenue or prevent fraud.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Framework and Product Comparison for Big Data Log Analytics and ITOA

Posted in Analytics, Big Data, Hadoop, Microservices on February 4th, 2016 by Kai Wähner

In February 2016, I presented a brand new talk at OOP in Munich: “Comparison of Frameworks and Tools for Big Data Log Analytics and IT Operations Analytics”. The focus of the talk is to discuss different open source frameworks, SaaS cloud offerings and enterprise products for analyzing big masses of distributed log events. This topic is getting much more traction these days with the emerging architecture concept of Microservices.

Key Take-Aways

  • Log Analytics enables IT Operations Analytics for Machine Data
  • Correlation of Events is the Key for Added Business Value
  • Log Management is complementary to other Big Data Components
Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Comparison of Stream Processing Frameworks and Products

Posted in Analytics, Business Intelligence, Hadoop, In Memory on October 25th, 2015 by Kai Wähner

See how products, libraries, and frameworks that full under ‘streaming data analytics’ use cases are categorized and compared.

Streaming Analytics processes data in real time while it is in motion. This concept and technology emerged several years ago in financial trading, but it is growing increasingly important these days due to digitalization and Internet of Things (IoT). The following slide deck from a recent talk at a conference covers:

  • Real world success stories from different industries (Manufacturing, Retailing, Sports)
  • Alternative Frameworks and Products for Stream Processing
  • Complementary Relationship to Data Warehouse, Apache Hadoop, Statistics, Machine Learning, Open Source R, SAS, Matlab, etc.
Tags: , , , , , , , , , , , , , , , , , , , , ,

Difference between a Data Warehouse and a Live Datamart?

Posted in Analytics, Big Data, Business Intelligence, In Memory on October 9th, 2015 by Kai Wähner

Data Warehouses have existed for many years in almost every company. While they are still as good and relevant for the same use cases as they were 20 years ago, they cannot solve new, existing challenges and those sure to come in a ever-changing digital world. The upcoming sections will clarify when to still use a Data Warehouse and when to use a modern Live Datamart instead.

What is a Data Warehouse (DWH)?

A Data Warehouse is a central repository of integrated data from more disparate sources. It stores historical data to create analytical reports for knowledge workers throughout the enterprise. A DWH includes a server, which stores the historical data and a client for analysis and reporting.

Tags: , , , , , , , , , , , , ,