Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:
- AI/data science
- data in general
- data architecture
- streaming
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog post is the “roundup” of the things that have been most interesting to me for the week just ending.
Data Architecture
- Data chess game: Databricks vs. Snowflake, part 1. There is no doubt that Snowflake and Databricks compete in becoming the leading cloud data platform vendor. This post looks at these two vendors from different backgrounds (data lake and warehousing) and how they are now “duking” it out in the lakehouse.
- Recap of Databricks Lakehouse Platform Announcements at Data and AI Summit 2022. The linked post summarises the major Databricks Lakehouse Platform innovations and contributions to open source and data and AI projects announced at the Data and AI Summit 2022.
Azure Data Explorer
- Scalable Data Analytics with Azure Data Explorer: Modern ways to query, analyze, and perform real-time data analysis on large volumes of data. A while ago, I was asked to do an editorial review of a book about Azure data Explorer. Well, the link here is to that book. If you are into ADX, then this book is a must-read.
Streaming
- 4 Must-Have Tests for Your Apache Kafka CI/CD with GitHub Actions. This post looks at applying GitOps principles to the development lifecycle of a Kafka client application using GitHub Actions: for testing locally and in Confluent Cloud, with and without Schema Registry, and for schema evolution.
- The Heart of the Data Mesh Beats Real-Time with Apache Kafka. We all have heard about the “data mesh” by now. A requirement of the data mesh infrastructure is that it is real-time, reliable, and scalable. This post explores how Apache Kafka, as an open and scalable decentralized real-time platform, can be the basis of a data mesh infrastructure and – complemented by many other data platforms like a data warehouse, data lake, and lakehouse – solve real business problems.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.
comments powered by Disqus