Interesting Stuff - Week 49, 2021

Posted by nielsb on Sunday, December 5, 2021

Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:

  • AI/data science
  • data in general
  • data architecture
  • streaming
  • distributed computing
  • SQL Server
  • transactions (both db as well as non db)
  • and other “stuff”

This blog-post is the “roundup” of the things that have been most interesting to me for the week just ending.

Big Data

  • The Foundation of Your Lakehouse Starts With Delta Lake. The Databricks Delta Lake has continuously evolved during the last few years, and in May 2021, Delta Lake 1.0 was announced. The evolution of Delta Lake doesn’t stop with the 1.0 release, and this blog post reviews the major features released so far and provides an overview of the upcoming roadmap.
  • What Is Trino And Why Is It Great At Processing Big Data. Trino is an open-source distributed SQL query engine for ad-hoc and batch ETL queries against multiple types of data sources. It previously went under the name of Presto, but due to various reasons, it had to change its name. The post linked to looks at Trino and covers its positives and negatives. At Derivco we have contemplated using Trino. Let us see what the future brings.

Streaming

WIND (What Is Niels Doing)

SQLBITS 2022 - The Greatest Data Show - is just around the corner, and I am happy to announce that I am doing a full-day training session:

Figure 1: SQLBITS 2022 - A Day of Azure Data Explorer

Yes, I am doing a whole day of Azure Data Explorer. Read more at: A Day of Azure Data Explorer.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.


comments powered by Disqus