Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
Distributed Computing
- Achieving High Throughput with Reliability in Transactional Systems. This presentation from InfoQ discusses architecting and designing a high performance, throughput & data processing transactional system and real-time access to large datasets via APIs.
- Getting from Monolith to Microservices. This InfoQ presentation looks at strategies to break a monolith, from the front-end to the back, including database refactoring and analysis tools to see dependencies in legacy code.
Data Science
- Training Your First Classifier with Spark and Scala. This post is an excellent introduction to machine learning with Spark and Scala.
Streaming
- Monitoring Apache Flink Applications 101. This blog post provides an introduction to Apache Flink’s built-in monitoring and metrics system, that allows developers to monitor their Flink jobs effectively.
- Journey to Event Driven – Part 3: The Affinity Between Events, Streams and Serverless. This post is the third part in the Journey to Eventdriven series, and it looks at how event-driven streaming architectures fit with serverless.
SQL Server 2019
- SQL Server 2019 community technology preview 2.3 is now available. What the title of the post says: CTP2.3 of SQL Server 2019 is now available for download. I have already downloaded and installed the Windows version, and right now I am in the process of installing the SQL Server 2019 Big Data Cluster on Azure Kubernetes Service. Happy Days!
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.
comments powered by Disqus