Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:
- AI/data science
- data in general
- data architecture
- streaming
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog post is the “roundup” of the things that have been most interesting to me for the week just ending.
Misc.
- 10 VSCode Productivity Hacks for Data Scientists. Yes, I know; the title of this post says Data Scientists. With that in mind, I should have put it under the AI/ML section. However, the “hacks” in the post are not only useful for Data Scientists but for anyone using Python and VSCode. The post is not so much about hacks as it is about useful VSCode extensions, and I picked up a couple that I want to test.
Azure Data Explorer
-
Fun With KQL – Format_DateTime. I hate dealing with date-time! That’s why I really like the Kusto Query Language (KQL) feature that this post discusses: the
format_datime
function. Read the post to see how it can be used. Very cool! - Kusto Detective Agency. The Kusto Detective Agency is a way to promote Azure Data Explorer and the KQL (the Kusto Query Language). You join the “agency” by creating a free Azure Data Explorer cluster and providing an answer to a question. Looks really cool. I joined earlier in the week, and I’ll keep you informed on how it goes.
AI/ML
- Serving ML Models with Apache Spark. This post covers more than what the title says. Yes, the post definitely looks at how to serve up MLLib models, but it also gives a comprehensive introduction to Spark. Very useful!
Data Architecture
- Stretching my Legs in the Data Engineering Ecosystem in 2022. This is the first post in a series by Mr Kafka ( Robin Moffat), where he looks at what is going on in the data engineering world nowadays. This is a must-read for anyone interested in data engineering!
- Data Engineering: Resources. This is from the link above. This post lists useful links to resources around data engineering.
Streaming
- Real-time analytics on network flow data with Apache Pinot. Having observability of your infrastructure is of vital importance to be able to detect, diagnose, and remediate issues. This post looks at what LinkedIn has done to provide observability into network flows. Very interesting!
- Real-Time Gaming Infrastructure for Millions of Users with Apache Kafka, ksqlDB, and WebSockets. This post which shows a real-time gamification demo to integrate gamers via WebSockets to the data streaming backend infrastructure is interesting to me from a couple of points. It is interesting as we ( Derivco) are in the iGaming industry. It is also interesting as we are doing something very similar right now!
WIND (What Is Niels Doing)
Just reminding you of this (especially you, Lee-Anne lol)
Don’t forget to register for Data Platform Summit 2022. Having registered, you can attend my session: ksqlDB - The Real Time Streaming Database, and obviously many other cool sessions as well.
Oh, and attending the conference is FREE, so what are you waiting for: register!
~ Finally
That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.
comments powered by Disqus