Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
- Building a Large-scale Transactional Data Lake at Uber Using Apache Hudi. Hudi, developed by Uber, is similar to Delta Lake in that it enables ACID semantics, (plus more), on data lakes. This post describes Uber’s journey to building, releasing, and optimizing Apache Hudi.
- Learning about distributed systems: where to start?. This post by Murat looks at resources to really learn and understand distributed computing. I urge you to look and read the resources Murat lists!
- Databases and Stream Processing: A Future of Consolidation. In this InfoQ presentation, the presenter digs into why both stream processors and databases are necessary from a technical standpoint but also by exploring industry trends that make consolidation in the future far more likely. He examines how these trends map onto common approaches from active databases like MongoDB to streaming solutions like Flink, Kafka Streams or ksqlDB.
- Use Cases and Architectures for Apache Kafka and Event Streaming in the Gaming Industry. Do.Not.Miss.This! The link here is for the registration page for a Confluent event where Kai Wähner looks at how Kafka and Confluent Cloud provides a scalable, reliable and efficient infrastructure to make gamers happy and gaming companies successful.
WIND (What Is Niels Doing)
Lockdown, lockdown, lockdown! What I would not give to be able to go out and have dinner with good friends at a restaurant right now!
Apart from that, things are not that bad. I am still working on the follow-up to the A Lap Around SQL Server 2019 Big Data Cluster: Background & Technology post. The upcoming post looks at the architecture. So when will it be published you may ask - I have no idea!
On Wednesday, (June 17), I am co-hosting the second The Azure Sketches, Architect Series webinar together with Gordon Beeming and Lawrance Reddy. We use a combination of traditional drawing & sketching with actual solution building to help reinforce the thought process in building Azure solutions and all of the dependencies in between. If you are interested, here is the link for registration.
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.