Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:
- AI/data science
- data in general
- data architecture
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog post is the “roundup” of the things that have been most interesting to me for the week just ending.
- Data Warehousing Modeling Techniques and Their Implementation on the Databricks Lakehouse Platform. This post talks about the different data modelling techniques supported by the Databricks Lakehouse Platform and how each fits within each layer of the Bronze, Silver, and Gold architecture. For me, this post is very timely as we are right now implementing a Data Lakehouse here at Derivco!
- Migrating to a Multi-Cluster Managed Kafka with 0 Downtime. This post discusses how they at Wix migrated 2000 microservices from on-prem Kafka clusters to Confluent Cloud. The post looks at critical design decisions and best practices for this type of migration.
- Announcing GA launch of Kafka Trigger extension on Azure Functions. As the title implies, this post announces the GA release of Kafka triggers. Kafka triggers enable you to invoke functions in response to messages in Kafka topics and let you write values/messages out to Kafka topics using an output binding. Very, very cool, and this comes at the right time for a project we are starting at Derivco!
- Applying Data Pipeline Principles in Practice: Exploring the Kafka Summit Keynote Demo. The Kafka Summit 2022 key-note demo showed how a fictional airline company merges with another airline company and wants a data pipeline built on Confluent to enable analytical and operational workstreams. This blog post deconstructs the demo through excerpts of real code used to create it. Very interesting!
WIND (What Is Niels Doing)
Figure 1: Microsoft Open Hack
So last week, Microsoft ran an OpenHack here at Derivco. The topic of the OpenHack was Modern Data Warehousing, covering Azure Data Lake Storage, Azure Databricks, Azure Synapse Analytics and other cool stuff. Anyway, the OpenHack ran for three days, and on the last day, Microsoft had an event in our canteen where Derivco’s employees had the chance to talk to various people from Microsoft.
Part of this event was the “selfie booth”, and in the picture above (Figure 1), you see me sitting beside Lee-Anne James, who is our contact at Microsoft for all Data and AI. I cannot express enough how awesome she is. She really goes all out for us! Thanks, Lee-Anne!
So I mentioned the “selfie booth”, and part of the deal was that if you took a selfie and published it on your blog, Twitter, LinkedIn, etc., you got some swag:
Figure 2: Azure Synapse Analytics Swag
In Figure 1 you see the swag I managed to score - not bad!
Oh, I also managed to publish a blog post around some “weird” Git errors:
- Solution to GIT: “unsafe repository (‘some-repo’ is owned by someone else)”. I recently re-formatted my PC, and after everything was installed again, I got a “weird” error when I tried to do a
git pull. The post looks at why I got the error and how to fix it.
That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.
comments powered by Disqus