Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:
- AI/data science
- data in general
- data architecture
- streaming
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog post is the “roundup” of the things that have been most interesting to me for the week just ending.
Data Architecture / Engineering
- Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department. This post is from 2016, but still very relevant, and it resonates with me due to what we are doing at Derivco right now. Without giving too much away (go and read the post), the post lays out a blueprint for building a data science team that can pivot and react quickly.
- Data Engineering Annotated Monthly – October 2022. Quite a few interesting things in this monthly Data Engineering newsletter. What I found of particular interest: Apache Doris, the official Spark Docker image, and the article about Data Engineers and plumbers.
- Data’s day of reckoning. First, if you are into data but not subscribing to Benn’s newsletter, click on the link and go ahead and sign up. I will be waiting here… Cool, so where were we? Ah, this post essentially says: we cannot all be Netflix, Google, etc. - expect to have different data and that we should be more targeted in our ambitions.
Azure Data Explorer
- Five Reasons to Dive into Azure Data Explorer. This post contains a link to a downloadable e-book focusing on the five reasons that make ADX a good fit for businesses. If you are interested in ADX (and you should be), download the book. In addition, come to Azure BootCamp South Africa 2022 in two weeks and hear me talk about ADX.
Azure Functions
- Announcing public preview of the Azure SQL trigger for Azure Functions. Azure Functions is “da’ bomb”. We all know that! This post announces something that makes it even better: the ability to trigger a function from changes to a table(s) in Azure SQL DB. We have had for a while the ability to use Azure SQL Bindings to connect and write to Azure SQL DB, but this now gives us the ability to “listen” to changes in the database. Very cool!
Streaming
- Debezium Releases Version 2.0 of Its Change Data Capture Tool. At Derivco, we are looking at using Debezium for quite a few use cases. Therefore the announcement in this post comes as music to our ears.
WIND (What Is Niels Doing)
Well as I mentioned in last weeks roundup I am off to Seattle and the PASS Data Community Summit 2022, where I present:
- ksqlDB - The Real-Time Streaming Database. As the title implies, the presentation is about ksqlDB.
I am flying out this evening and will be back in SA next Sunday. Then the following Saturday (November 26), I present at the Azure BootCamp South Africa 2022 in Cape Town. There I am doing two talks:
- Analyze Billions of Rows of Data in Real-Time Using Azure Data Explorer
- ksqlDB and Azure Serverless Functions: A Match Made in the Cloud
This will be fun, registration is FREE, so you have no excuse not to attend! 😄
~ Finally
That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.
comments powered by Disqus