Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
Distributed Computing
- Modern Banking in 1500 Microservices. This is an InfoQ presentation where the presenters explain how Monzo builds, operates, observes and maintains the banking infrastructure. They talk about how they compose microservices to add new functionality, deployment and incident tooling, monitoring practices and how they share knowledge effectively. Very interesting!
- Pat Helland on Software Architecture and Urban Planning. The link here is to a podcast by the legendary Pat Helland where he talks about the relationship between software architecture and urban planning. Pat explores planning for future growth, regulations/standards, and communication practices that cities - and software architecture - had to evolve to use. He uses these comparisons to distil lessons that architects can use in building distributed systems. If you are only going to listen to one podcast today, this week, month, etc., listen to this one!
Azure Data Studio
- The September 2020 release of Azure Data Studio is now available. As the title of the post says: a new version of Azure Data Studio is out. It has quite a few new features and enhancements, new notebook functionality, support for Kusto Query Language, (KQL), and other stuff. After I have published this post I will download this version and take it through its paces.
SQL Server
- Open sourcing the R and Python language extensions for SQL Server. Wow, this is big! Not only have the SQL Server language extensions for R and Python been open-sourced, but you can now use whatever version of R and Python you want, (well, more or less). Look out for some blog posts from me around this. Here are two links if you are interested in how it worked up until now: Sql Server Machine Learning Services, and Sql Server Extensibility Framework.
Streaming
- Apache Kafka DevOps with Kubernetes and GitOps. The post linked to here looks at how we can do DevOps with Kafka and Kubernetes.
- Streaming logging pipeline of Home timeline prediction system. Twitter recently built a streaming data logging pipeline for its home timeline prediction system using Apache Kafka and Kafka Streams to replace the existing offline batch pipeline. This post looks at the architecture and how Twitter did it. In the post, it is mentioned that Twitter had to customise how KStreams left join semantic, and that is covered in the post: Building a Machine Learning Logging Pipeline with Kafka Streams at Twitter.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.
comments powered by Disqus