Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
- 2020 Tech Predictions. As it is the beginning of a new year, (and a decade at that), you see prediction posts being published. Here is a predictions post for 2020 from an old colleague from the Developmentor days, Ted Neward.
- Are all samples created equal?: Boosting generative models via importance weighting. This is a very interesting post discussing how to correct the imperfections of generative models. Please go and read the post to see what techniques they use.
- Data Literacy in 10 Minutes. In last weeks roundup, I mentioned that Buck Woody just started a blog post series about data literacy. Well, the link here is to the whole series posted as one single post on LinkedIn.
- The Reunification of .NET 5. The post linked to here discusses how Microsoft tries to merge .NET Framework with .NET Core, in order to have one single .NET offering.
- BUILDING A GRAPH DATABASE USING KAFKA. Another excellent blog post about Kafka by Robert Yokota. In this post, he looks at how you can create a Kafka based graph database. Very exciting! Oh, and if you are interested in distributed systems, and/or, Kafka, his blog should be in your RSS feed.
- Gunnar Morling on Change Data Capture and Debezium. This is an InfoQ podcast. The podcast discusses Debezium, the open-source distributed platform for change data capture (CDC).
- Streams and Tables in Apache Kafka: A Primer. The post linked to here is the first in a series about Kafka, streams and tables. The series should be mandatory reading for anyone working with Kafka! I am making sure that anyone at Derivco working with Kafka reads this series!
Microsoft Ignite The Tour | Johannesburg
The tour comes to Johannesburg January 30 - 31, and I am lucky enough to present at the event. I am doing three presentations, and also some community “booth duty”:
- A Lap Around SQL Server Big Data Cluster
- Simplify and Scale Your Data Pipelines with Azure Delta Lake
- Improve Customer Lifetime Value using Azure Databricks & Delta Lake
The conference is free of charge, so register now and come and say Hi!
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.