Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
- Data Lake: Concepts, Characteristics, Architecture, and Case Studies. This is a long post where the authors try to explain what a Data Lake is, characteristics of a Data Lake, the architecture of a Data Lake, and a lot more. It is an excellent read!
Big Data Analytics
- Building Latency Sensitive User Facing Analytics via Apache Pinot. In this InfoQ presentation, the presenter discusses how LinkedIn, Uber and other companies managed to have low latency for analytical database queries despite high throughput.
- Introducing Confluent Platform 6.1. The post linked to here announces, as the title implies, the 6.1 version of Confluent Platform. There are quite a few new exciting features in this release. What excites me the most are the enhanced functionality of ksqlDB! That is something that will help us at Derivco a lot.
WIND (What Is Niels Doing)
While writing my blog posts about the open-sourced Python SQL Server language extension, I wanted to install SQL Server on a new, clean server. I decided to do it on an Azure VM instead of “messing” with VM’s on my box.
It was not as straight forward as I thought, so I am now about to finish a post about what I did.
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.