Interesting Stuff - Week 26, 2021

Posted by nielsb on Sunday, June 27, 2021

Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:

  • AI/data science
  • data in general
  • data architecture
  • streaming
  • distributed computing
  • SQL Server
  • transactions (both db as well as non db)
  • and other “stuff”

This blog-post is the “roundup” of the things that have been most interesting to me for the week just ending.

Machine Learning / AI

  • Need for Data-centric ML Platforms. When we do ML projects, it comes naturally to have a model-centric approach to the project. This Databricks blog post argues that having a data-centric view increases the chances for success. Very interesting!

Big Data

  • Azure Data Explorer and the Kusto Query Language. This blog post is part one of a two-part series looking at Azure Data Explorer (ADX) and Kusto; its query language. This post introduces ADX and looks briefly into what you can do with Kusto. After having read this post, I look forward to part two!
  • Update on geospatial functions. In last weeks roundup, I mentioned a post about geospatial functions in Apache Pinot. The post I link to is also about geospatial functions, but this time in Azure Data Explorer. Man, you can do some cool stuff with geospatial queries!
  • Get Your Free Copy of Delta Lake: The Definitive Guide (Early Release). Databricks has for a long time been talking about the Lakehouse architecture and how their Delta Lake table format can help create Lakehouses. This post announces the first book about Delta Lake and Lakehouses. I have already downloaded the book, and I suggest you do the same.

Streaming / Data Architecture

  • Saxo Bank’s Best Practices for a Distributed Domain-Driven Architecture Founded on the Data Mesh. I have covered posts about the Data Mesh in quite a few previous roundups, and this post is another one around data meshes. It looks at how Saxo Bank in Denmark has moved to a domain-driven data architecture with Kafka as an integral part and the lessons learned/best practices. The post also contains very, very useful references! The post is a must-read if you are interested in data architecture!

WIND (What Is Niels Doing)

Figure 1: I Am Speaking

As I mentioned in the roundup for week 24, I am speaking about Apache Kafka and Azure Data Explorer at the 2021 Data Platform Summit. So right now, I am researching and prepping for that.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.


comments powered by Disqus