Interesting Stuff - Week 42, 2023

Posted by nielsb on Sunday, October 22, 2023

This week, Generative AI only: Uncover groundbreaking developments in Large Language Models (LLMs) like “Prometheus” and “Table-GPT,” reshaping the AI landscape. Explore the power of Microsoft Azure in connecting GPT models with company data through a beginner’s guide.

Plus, learn how LLMs can jumpstart your projects and discover the critical role of manual adjustments for production-ready solutions.

Generative AI

  • Top Important LLM Papers for the Week from 9/10 to 15/10. This post provides a summary of important papers in the field of Large Language Models (LLMs) published during the second week of October. The papers cover various aspects of LLM research, including reasoning capabilities, text generation and summarization, progress and benchmarking, and fine-tuning. Notable papers include “Prometheus”, an open-source LLM for evaluation tasks, “EIPE-text” for narrative text generation, and “Table-GPT” for table understanding. Additionally, there are papers on harmonizing natural language and code with “Lemur” and a zero-shot agent for computer control. Finally, one paper discusses joint language modelling for speech and text, and another introduces “LoftQ,” a quantization framework for LLMs. These papers contribute to advancing LLMs in various domains. Very interesting!
  • Models with Company Data in Microsoft Azure. The post linked to is a beginner tutorial on connecting GPT models with company data in Microsoft Azure using OpenAI Studio, Cognitive Search and Storage Accounts. The author explains how to upload internal documents to Azure Blob Storage and create a container for them, create an Azure Cognitive Search resource and an Azure OpenAI resource to index the documents and deploy a GPT model, use the Chat section in Azure OpenAI Studio to ask questions to the GPT model and get answers based on the indexed documents, and customize the system message, parameters, and web app deployment options for the GPT model. The post also provides screenshots and code snippets to illustrate the steps. We are looking at doing this at Derivco, so this is very interesting to me.
  • Mastering the Future: Evaluating LLM-Generated Data Architectures leveraging IaC technologies. This blog-post discusses using Large Language Models (LLMs) in the application lifecycle, specifically in Infrastructure as Code (IaC). It explores LLMs’ benefits in kickstarting projects but notes potential biases. The article covers three LLM tasks: provisioning AWS virtual machines, creating a FastAPI app for Elasticsearch, and generating Ansible scripts for Elasticsearch and Kibana. LLMs provide a structured foundation but require manual adjustments for production readiness. In conclusion, LLMs offer valuable support but necessitate expertise for refining the code.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.


comments powered by Disqus