Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:
- AI/data science
- data in general
- data architecture
- streaming
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog post is the “roundup” of the things that have been most interesting to me for the week just ending.
AI/ML
- Building A Virtual Machine inside ChatGPT. In last week’s roundup, I mentioned ChatGPT, and here is more about its coolness. This post looks at running a virtual machine inside ChatGPT! I don’t know about you, but to me, this is plain scary!
- AI Homework. Related to ChatGPT - this post looks, among other things, at how ChatGPT works under the covers. Very interesting!
Data Architecture
- Build Reliable and Cost Effective Streaming Data Pipelines With Delta Live Tables’ Enhanced Autoscaling. Earlier this year, Databricks announced the general availability of Delta Live Tables (DLT). An ETL framework which introduces a simple, declarative approach to building reliable data pipelines. This post looks at a new feature of DTL: Enhanced Autoscaling for Delta Live Tables (DLT), which allows analysts and data engineers to quickly create production-ready streaming or batch data pipelines.
- How analysis dies. I found this post very thought-provoking, and I liked it a lot. So much so that I am forwarding it to other people at Derivco. However, I have difficulty summarizing it, so why don’t you just read it yourselves 😄!
Streaming
- What Are Apache Kafka Consumer Group IDs?. This an excellent post about an often misunderstood concept in Kafka: consumer groups. If you write any Kafka applications, read this post!
~ Finally
That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.
comments powered by Disqus