Interesting Stuff - Week 35, 2021

Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:

AI/data science
data in general
data architecture
streaming
distributed computing
SQL Server
transactions (both db as well as non db)
and other “stuff”

This blog-post is the “roundup” of the things that have been most interesting to me for the week just ending.

Big Data / Machine Learning

Churn Prediction With BigQueryML to Increase Mobile Game Revenue. Seeing what we do at Derivco, this post is exciting. The post looks at how machine learning can identify high-value mobile game players dangerously close to churning. Very interesting!

Data Architecture

Five Predictions for the Future of the Modern Data Stack. This post looks at the developments of the modern data stack and the bright side of “Modern Data Stack V2”, focusing on AI, Data Sharing, Data Governance, Streaming & Application Serving.

Azure Data Explorer

Timeseries Analytics Capabilities, and Azure Data Explorer (ADX). I guess that for you who read my blog, it doesn’t come as a surprise that I have a thing for Azure Data Explorer. The post here looks at time-series analytics and explores the types of core functionality typical for time-series data processing applications. It further looks at how functionality built into ADX aligns exceptionally well to meet these challenges head-on.

Streaming

Real-time anomaly detection with Apache Kafka and Python. In this post, the author looks at making real-time anomaly predictions over streaming data coming from Kafka using Python.
How ksqlDB Works: Internal Architecture and Advanced Features. To effectively use ksqlDB, you should, apart from being familiar with its features and syntax, also have an understanding of what’s happening “under the cover” of ksqlDB. This post covers some of the “under the cover” topics as well as points to resources at Confluent Developer.

WIND (What Is Niels Doing)

By now, you probably know that I:

Figure 1: Breakout Session

Yes, as we see in Figure 1 I am presenting at the 2021 Data Platform Summit:

How to do Real-Time Analytics Using Apache Kafka and Azure Data Explorer. We are looking at how to stream events from Apache Kafka to Azure Data Explorer and perform user-facing analytics in near real-time.

I mentioned in a previous roundup how the organizers have managed to increase the capacity of the virtual platform to 10,000! So, they have opened up FREE booking for LIVE attendance for a limited time. They have an internal quota, and once that is full, the free booking will close. So, what are you waiting for? Hurry up to register for FREE!

Oh, I am not only doing the conference session above, but also a post-conference training class; 4 hours per day over 2 days:

Big Data & Analytics with SQL Server 2019 Big Data Cluster.

There are still a couple of seats (virtual) available for my class, so - if you are interested - register here.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.