Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
.NET
- Building C# 8.0. A post about new functionality in the upcoming C# 8 release. Some very interesting new features, and it looks like C# gets more and more features that you normally find in F#.
- Sample performance investigation using BenchmarkDotNet and PerfView. In this blog post Adam describes how he approaches sample performance problem using available free .NET tools and best practices for performance engineering.
Misc.
- My Emacs journey. This blog post by Murat, where he talks about Emacs, brings me back to the early days of .NET, (pre 1.0), where I created an Emacs .NET intellisense extension. Ah, those were the days!
Distributed Computing
- Overload control for scaling WeChat microservices. In this blog post Adrian looks at a white paper discussing how WeChat handles overload control. Some very impressive numbers and quotes: “WeChat’s microservice system accommodates more than 3000 services running on over 20,000 machines in the WeChat business system, and these numbers keep increasing as WeChat is becoming immensely popular… As WeChat is ever actively evolving, its microservice system has been undergoing fast iteration of service updates. For instance, from March to May in 2018, WeChat’s microservice system experienced almost a thousand changes per day on average.”. Think about that; 20,000 machines, thousand code changes per day, wow!
Streaming
- Stream processing: An Introduction to Event Time in Apache Flink. Apache Flink supports multiple notions of time for stateful stream processing. This post focuses on event time support in Apache Flink.
- Kafka Connect Deep Dive – Converters and Serialization Explained. In this article Robin Moffat takes us through how serialization works in Kafka Connectors.
SQL Server 2019
- Creating a data hub for your analytics with PolyBase. SQL Server 2019 CTP 2.0 introduces new connectors for PolyBase, and this blog post discusses how the new connectors enable customers to leverage PolyBase for creating a virtual data hub for a wide variety of data sources within the enterprise. Very interesting!
Data Science / AI
- The Ultimate R Cheatsheet. As the title says, this post points to an impressive cheatsheet for R.
- Intro to Data Science for Managers [Mindmap]. Data science is incredibly broad and complex discipline and can be daunting trying to get your head around. So this blog post tries to mind-map the various key data science concepts and techniques in order to make data science easier to grasp.
- Announcing new AI Capabilities for Power BI to make AI Accessible for Everyone. I guess the title of this blog posts says it all. It announces new AI capabilities for Power BI. This is huge, and I can not wait to test it out!
- Big Data and Deep Learning: A Tale of Two Systems. An InfoQ presentation about how Uber tackles data caching in large-scale deep learning, its ML architecture and discusses how Uber uses Big Data. The presentation concludes by sharing AI use cases.
- Implementing Facebook Prophet efficiently. I wrote back in 2017 about Facebook Prophet and how to run it in Microsoft R Server as well as SQL Server Machine Learning Services. The blog post I link to here discusses how to optimize Prophet. Very interesting!
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.
comments powered by Disqus