Interesting Stuff - Week 21, 2022

Posted by nielsb on Sunday, May 29, 2022

Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:

  • AI/data science
  • data in general
  • data architecture
  • streaming
  • distributed computing
  • SQL Server
  • transactions (both db as well as non db)
  • and other “stuff”

This blog post is the “roundup” of the things that have been most interesting to me for the week just ending.

Misc.

  • How to Manage Conda Environments on an Apple Silicon M1 Mac. As I mentioned in a previous roundup, I have switched over to the “dark side” for my travel laptop - I did get myself a MacBook Pro. Coming from a Windows background setting it up and configuring it is sometimes a chore, so I tend to save links to blog posts/articles covering setup/configuration of a Mac. This link is such a post, and it looks at how to manage different Python environments using Conda.
  • Microsoft Build event announcements. Last week (May 24 - 26), Microsoft held its annual Build conference. Many interesting things were announced, and the post linked to looks at some of the data platform announcements. One “biggie” is that SQL Server 2022 has been released into Public Preview.

Azure Data Explorer

  • Deleting individual records in a table. You who read my blog probably know that I like Azure Data Explorer. You also may know that my background is SQL Server. I mention my SQL Server background because sometimes doing things in ADX is somewhat different from what you’d do in SQL Server. An example of this is deleting records. That is what this post looks at. It covers deleting individual records from a table and how it works behind the scenes.
  • Optimized autoscale – predictive planning. Azure Data Explorer has supported autoscaling (out and in) for quite some time. The autoscaling has been reactive, i.e. the scaling is delayed and requires time to alleviate the pressure off the cluster. This post looks at predictive autoscaling. It was recently added, and it tracks main metrics and builds a usage pattern. This allows ADX to plan cluster scale out and scale in ahead of time to further improve cluster performance while reducing cost.

Streaming

  • Benchmarking Azure Event Hubs Premium for Kafka and AMQP workloads. Microsoft recently announced the general availability of Azure Event Hubs Premium. It is a new product tier meant to support high-end event streaming scenarios. This post benchmarks this new product tier against Kafka, and the numbers are quite interesting. It is probably worth having a look at Azure Event Hubs Premium.
  • What’s New in Apache Kafka 3.2.0. The title of this post kind if gives it away. Yes, the post looks at some of the new and improved features in the 3.2 release of Apache Kafka. What is of particular interest to me is the new features in KStreams. I hope they will make way for new functionality in ksqlDB as well.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.


comments powered by Disqus