Interesting Stuff - Week 45, 2021

Posted by nielsb on Sunday, November 7, 2021

Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:

  • AI/data science
  • data in general
  • data architecture
  • streaming
  • distributed computing
  • SQL Server
  • transactions (both db as well as non db)
  • and other “stuff”

This blog-post is the “roundup” of the things that have been most interesting to me for the week just ending.

SQL Server

  • Announcing the SQL Server 2022 Early Adoption Program. The Microsoft Ignite conference was held during the week just gone by. As expected, there were quite a few announcements around new and improved products. One such announcement was related to this post; the next version of SQL Server is in the works, and Microsoft has just opened the Early Adoption Program (EAP) for SQL Server 2022. If you are interested in shaping the next version of SQL Server, I suggest you sign up!

Streaming

  • Architecting a Kafka-centric Retail Analytics Platform — Part 2. In last weeks roundup, I linked to the first post in a series, of which this post is the second instalment. In this post, the author, Dunith Dhanushka, looks at data ingestion into Kafka in detail. As the series is about retail, the post looks at the retail data landscape, what data to capture, and how it can be ingested into Kafka using the Kafka ecosystem.
  • Readings in Streaming Database Systems. This post is the introduction/prequel to a series looking at streaming databases: The Streaming Database Series. This post gives a high level overview of what’s coming in the posts in the series. It also provides an overview of streaming databases.
  • The Future of SQL: Databases Meet Stream Processing. This post is the first in the The Streaming Database Series mentioned above. The post discusses why the database world needs enhancements to handle data both at rest and in transit. The enhancements looked at are the STREAM abstraction, new query types, and extended semantics for handling time.
  • 4 Key Design Principles and Guarantees of Streaming Databases. The second in the series mentioned above, this post summarizes a few challenging design principles for modern streaming databases that act as a source of truth for stream data management and query processing systems. The post also presents ksqlDB’s persistent log-based approach to following the design principles.
  • How Do You Change a Never-Ending Query?. The post linked to is the third in the The Streaming Database Series. The post looks at how we can evolve queries in a streaming database and some of the pitfalls that may occur.

WIND (What Is Niels Doing)

Now is the conference season, and I am presenting at Cloud Data Driven:

Figure 1: Cloud Data Driven

I will be talking about how to calculate Customer Lifetime Value using Azure Databricks. If you are interested, the registration is FREE, so go ahead and register.

As you see, the presentation is on Thursday, November 11. If you read the last week’s roundup, you may have noticed this:

Figure 2: PASS Session

Yes, I am doing a live PASS Q&A the same day. The PASS session id for my Analyze Billions of Rows of Data in Real-Time Using Azure Data Explorer presentation. Fortunately, the PASS session is 3:15 - 3:45 pm UTC, and my Cloud Data Driven presentation is at 4 pm UTC. Phew!

So here is an idea; get a double dose of Niels:

Yay, Niels on Thursday from 3:15 UTC. What could be better than that? Actually, don’t answer that question.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.


comments powered by Disqus