Interesting Stuff - Week 35, 2022

Posted by nielsb on Sunday, September 4, 2022

Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:

  • AI/data science
  • data in general
  • data architecture
  • streaming
  • distributed computing
  • SQL Server
  • transactions (both db as well as non db)
  • and other “stuff”

This blog post is the “roundup” of the things that have been most interesting to me for the week just ending.

Misc.

  • Microsoft Dev Box Now in Public Preview. This InfoQ post looks at the recent announcement of the public preview of Microsoft Dev Box, a workstation in the cloud targeted to developers. Integrated with Windows 365, the new service supports any developer IDE, SDK, or tool that runs on Windows and simplifies the onboarding of new developers.

Azure Data Explorer

  • ADX Web Explorer updates – August 2022. We all know that Azure Data Explorer is awesome! One of the cool things with ADX is its Web UI: ADX Web Explorer. The post linked to looks at what new (and enhanced features) there are in the August 2022 release of Web Explorer. The Funnel visual is one of the things I am excited about.

Machine Learning / AI

  • Challenges and practical lessons from building a deep-learning-based ads CTR prediction model. In the ads business being able to predict click-through-rate is essential. This post describes how LinkedIn changed their previous CTR model for a deep-learning-based system, the challenges they experienced, and the lessons learned. Very interesting!
  • Using AI To Improve Your Literature Review. When researching ML/AI topics, finding white papers/publications around the topics you are interested in is vital. However, the past twenty years have witnessed exponential growth in the number of published scientific papers, and it can be hard finding what you need with so many papers to “trawl” through. That’s where what this post covers comes in. A research lab has released free software which allows you to write a query to search for papers you are interested in. The search functionality is backed by AI algorithms based on language models, such as GPT-3, and shows relevant papers to the query you wrote.

Streaming

  • Getting Started with the KRaft Protocol. The work to remove Apache Kafka’s ZooKeeper dependency in favor of the KRaft (Apache Kafka Raft) protocol has been going on for a long time. It is not yet ready for use in production environments, but it is getting close. If you want to test it out, this post lists several key resources to help you learn everything you need about the ins and outs of KRaft.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.


comments powered by Disqus