Interesting Stuff - Week 33, 2023

Posted by nielsb on Sunday, August 20, 2023

🚀 In this post: Unveil AI/ML marvels and streaming sensations! Python Vector Databases, Real-Time ML, Apache Flink, Confluent Platform, and more.

Plus, relive the record-breaking Data Saturday Durban event! Join the innovation journey.


  • Python Vector Databases and Vector Indexes: Architecting LLM Apps. This post provides an overview of vector databases and vector indexes in Python. It explains what they are, how they work, and their use cases and benefits. It also compares and contrasts vector databases and indexes, providing examples of Python libraries and tools that implement them. It concludes by highlighting the potential of vector search for developing AI-powered applications.
  • Real-Time Machine Learning: Architecture and Challenges. In this InfoQ presentation, the presenter discusses a topic very dear to me and a topic we try to solve at Derivco: Real-Time Machine Learning. The presenter discusses the value of fresh data, different types of architecture, and the challenges of online prediction. If you are into ML and real-time, you should watch this presentation!


  • Stream Processing Simplified: An Inside Look at Flink for Kafka Users. The blog post introduces Apache Flink, a stream and batch processing framework that can work with Kafka. It explains the features and benefits of Flink, such as event-time processing, stateful computations, exactly-once guarantees, and flexible APIs. It also describes the architecture and components of Flink and how it integrates with Kafka to form a data streaming stack. It gives examples of Flink applications in various domains and links to resources and tutorials.
  • How Alex Bank built a real-time banking experience with Confluent. This blog post from Confluent discusses how they helped a European bank to create a real-time banking experience using their platform. It explains how the Confluent Platform solved the challenges and opportunities of modernizing the banking industry with real-time data and event streaming. It describes the use cases and solutions the Confluent Platform provided for the bank, such as fraud detection, customer interactions, payments, and analytics. The post also shares some best practices and lessons learned from the bank’s adoption of the Confluent Platform.

WIND (What Is Niels Doing)

In last week’s roundup, I wrote I prepped for Data Saturday Durban. It took place yesterday (August 19) and was a resounding success! We had 100+ enthusiastic data professionals eager to sit in on the 14 presentations by 13 speakers. Having over 100 attendees for a free event on a Saturday in Durban, when there is a huge rugby game on the go (Boks vs Wales - go Boks), is a record!

Figure 1: Attendees

It is a validation that people are interested in hearing about what is happening in the industry, inspiring us to have more events and growing the community! If you are interested, please sign up for the local user group Azure Transform User Group - we will use that as a communication vehicle!

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.

comments powered by Disqus