Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
- The Changing Face of ETL. An article by Robin Moffat about the “new” ETL, based on event-driven architectures and streaming platforms.
- Kafka Donuts. This post is the introduction and TOC to a series of posts about Kafka. The author is my colleague Charl Lamprecht, and in the series, he discusses the use of Kafka in a company who manufactures and sells Donuts. Reading the series introduction post, it is clear that this series is a MUST for everyone interested in Kafka. The first episode: Donut Broker is here, and the second episode Donut Baker is here.
- Docker on Windows Subsystem for Linux using VirtualBox and Docker Machine. This post by Travis Illig discusses how you can enable both VirtualBox as well as Docker for Windows on the same Windows box.
So, Microsoft Ignite starts tomorrow (September 24). It looks to be an awesome conference with lots and lots of announcements of new “stuff”, I for one cannot wait!
If you, like me, are not attending but still want to follow the key-notes and various sessions, this link takes you to the live stream.
The other day I looked at the sessions and here are some that interests me:
- BRK2416 - The roadmap for SQL Server
- BRK2183 - SQL Server Machine Learning Services: An E2E platform for machine learning
- BRK3228 - What’s new in SQL Server on Linux and containers
- BRK2229 - The future of SQL Server and big data
- THR2168 - The next generation of SQL Server tools
- BRK4021 - Deep dive on SQL Server and big data
- THR2171 - Deploying a highly available SQL Server solution in Kubernetes
- BRK3154 - SQL Server in containers for application development and DevOps
- THR2308 - SQL Server vNext meets AI and Big Data
As you see, mostly SQL Server related sessions, and I must say that the sessions around SQL Server and Big Data intrigues me.
- How Machine Learning Works: 3 Resources To Learn And Develop ML Applications. The Lightbend team has put together some resources about how to design, build, run and manage machine learning applications in production.
- Simplify Market Basket Analysis using FP-growth on Databricks. In retail, you want to recommend to shoppers what to purchase, and often you base the recommendations on items that are frequently purchased together. A key technique to uncover associations between different items is known as market basket analysis. This blog post talks about how you run your market basket analysis using Apache Spark MLlib
FP-growthalgorithm on Databricks.
- Introduction to Deep Learning. This blog post is the first in a series about Deep Learning. At a quick glance, the series looks very informative.
SQL Server Machine Learning Services
I am still working on the third post in the Install R Packages in SQL Server ML Services series. I hope to be able to publish it soon:ish.
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.