Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:
- AI/data science
- data in general
- data architecture
- streaming
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me for the week just ending.
Misc.
- Beyond REST. This Netflix blog post discusses how Netflix use GraphQL microservices as a backend platform, facilitating rapid application development. It looks very interesting. I wonder if we at Derivco could use this?
Analytics
- Building a Climate Dashboard with Apache Pinot and Superset. Hmm, somehow, I must have missed this blog post from back in September 2020. Anyway, better late than never. The post discusses how Apache Pinot can easily ingest, query, and visualize millions of events. In this case, the events are climate events, sourced from the NOAA storm database.
- Fighting spam with Guardian, a real-time analytics and rules engine. The post linked here looks at the evolution of Pinterest’s spam-fighting rules and query and what they’ve learned throughout the process.
Data Architecture
- How To Modernize Your Data Architecture Part 1 – Data Analytics Strategy Consulting. This post is the first in a series about data architecture. It discusses what to avoid when building a data architecture and which questions to ask when building a future data
- From Data Lakes to Data Reservoirs. The post linked to here looks at how you can “tame” your data lakes. How you can create clean, beautiful, and protected data resources with Apache Spark and Databricks Delta Lake.
- The Building Blocks of a Modern Data Platform. If you wonder what a “modern data platform” means, then this post is for you. The post breaks down what a modern data platform means in practice today. This includes the three core characteristics, six fundamental building blocks, and the latest data tools. I found this post extremely valuable.
Streaming
- Lessons Learned from Running Apache Kafka at Scale at Pinterest. The post linked to shares how Pinterest runs Kafka and discusses some of the challenges they’ve faced and how they addressed them.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.
comments powered by Disqus