Interesting Stuff - Week 14, 2023

This week’s exciting stuff: cool video from an Azure Data Explorer (ADX) workshop. Speaking of ADX, I published my third post in the series of building a real-time leaderboard using Kafka and ADX. On the AI/ML side, “stuff” about building a GPT-powered application and a really interesting intro to reinforcement learning.

It wouldn’t be a roundup without something about Kafka. So, a link to a post about Kafka and fraud detection. Also, a link to the second post in the series about machine learning and streaming data.

Azure Data Explorer

Azure Data Explorer L300 workshop – Time Series analytics, high concurrency apps and visualization. This YouTube video is from an Azure Data Explorer workshop. The video is from the last day of the workshop, and it covers time series analytics, optimising for concurrency, and visualising data. It is excellent!
Develop a Real-Time Leaderboard Using Kafka and Azure Data Explorer - II. This is my third post in the series about using Kafka and Azure Data Explorer (ADX) to develop a real-time leaderboard for online gaming. In this post, I cover more about ingesting data into ADX and some “cool stuff” related to that. Specifically, what to do if you need to update/transform the data you ingest.

AI/ML

LangChain 101: Build Your Own GPT-Powered Applications. LangChain is a framework for developing applications powered by language models. This post explores LangChain and what you can build with it. The post also contains an example of creating a simple question-answering app with LangChain.
Our Learnings from the Early Days of Generative AI. The blog post discusses LinkedIn’s experience with generative AI in its early stages, including the challenges faced and lessons learned. It emphasizes the importance of ethics, transparency, and collaboration when working with AI. It suggests that organizations prioritize investing in people and technology to ensure responsible and effective use of AI in the future.
Introduction to Reinforcement Learning Series. Tutorial 1; Motivation, States, Actions, and Rewards. This article is the first in a series of tutorials around reinforcement learning: Introduction to Reinforcement Learning. The article covers the basic concepts of RL, including the Markov Decision Process (MDP), rewards, and value functions. It also provides an overview of different RL algorithms, such as Q-learning and policy gradient methods, and discusses their strengths and weaknesses. The article concludes by highlighting some of the current challenges in RL and potential future developments in the field.

Streaming

Fraud Detection with Apache Kafka, KSQL and Apache Flink. Fraud detection is vital in a digital world, and real-time data processing with Apache Kafka is the de facto standard to correlate and prevent fraud continuously before it happens. This blog post explores case studies for fraud prevention from companies such as Paypal, Capital One, ING Bank, Grab, and Kakao Games that leverage stream processing technologies like Kafka Streams, KSQL, and Apache Flink.
Uniting the Machine Learning and Data Streaming Ecosystems - Part 2. This is the second post in a series looking at machine learning and streaming data. This post explores the opportunities and pitfalls of SQL as an ecosystem bridge. It looks at solutions to interface the JVM and Python interpreter and, finally, dives into code examples that show how to build a streaming machine-learning solution today.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.

Azure Data Explorer

AI/ML

Streaming

~ Finally

CATALOG