Mar 1Member-onlyWebGPT: Improving the factual accuracy of language models through web browsingIn this blog post, which is part of my blog post series on conversational AI and chatbots, I will review WebGPT proposed by OpenAI. Introduction WebGPT article discusses the challenge of long-form question-answering (LFQA) in natural language processing (NLP). LFQA systems have the potential to become a primary source of learning…Deep Learning5 min readDeep Learning5 min read
Published in AIGuys·Feb 26Member-onlyGraphs to Graph Neural Networks: From Fundamentals to Applications — Part 1b: Graph Theory FundamentalsIn this post, which is the second post from my blog post series on Graphs, we will go over another 10 questions about graph theory fundamentals. You can find the previous post here: Graphs to Graph Neural Networks: From Fundamentals to Applications — Part 1a: Graph Theory… I decided to start a series of blog posts on Graphs and Graph Neural Networks (GNNs) to learn more about these topics…kargarisaac.medium.comGraph20 min readGraph20 min read
Published in AIGuys·Feb 21Member-onlyLaMDA: Language Models for Dialog ApplicationsNote: ChatGPT and other AI tools are used as assistants in this blog post. Introduction This paper proposes LaMDA, a family of transformer-based neural language models designed for dialog proposed by Google. These models’ sizes range from 2B to 137B parameters, and they are pre-trained on a dataset of 1.56T…Chatbots8 min readChatbots8 min read
Published in AIGuys·Feb 17Member-onlyGraphs to Graph Neural Networks: From Fundamentals to Applications — Part 1a: Graph Theory FundamentalsI decided to start a series of blog posts on Graphs, Knowledge Graphs, and Graph Neural Networks (GNNs) to learn more about these topics. I thought it would also be fun to test ChatGPT and the new Bing, which I got access to recently, and see how they can help…Deep Learning19 min readDeep Learning19 min read
Published in AIGuys·Feb 11Member-onlyBlenderBot v2Note: ChatGPT and other AI writing assistant tools are used in this post to improve the writing quality. This article is part of my blog series on chatbots and conversational agents. In earlier posts, we reviewed ChatGPT, Meena, and BlenderBot v1. We will now review BlenderBot v2, which was developed…NLP12 min readNLP12 min read
Published in AIGuys·Feb 4Member-onlySCENE: Reasoning about Traffic Scenes using Heterogeneous Graph Neural NetworksEncoding the driving scene in an autonomous driving system is a very important task that has not yet been solved completely. This work is one of the most recent ones that I found interesting among the many researchers who are working to encode driving scenes using graph neural networks. Let's…Autonomous Cars9 min readAutonomous Cars9 min read
Published in AIGuys·Jan 27Member-onlyRecipes for building an open-domain chatbot: BlenderBot v1In the previous blog post, we talked about Meena, a Google AI chatbot. In this post, I will review BlenderBot, the chatbot developed and open-sourced by Meta. I will go through the next versions of the BlenderBot in the next few posts. Let’s get started! Introduction BlenderBot is an open-domain…Naturallanguageprocessing8 min readNaturallanguageprocessing8 min read
Published in AIGuys·Jan 17Member-onlyMeena — Towards a Human-like Open-Domain ChatbotNote: This blog post is written with the help of ChatGPT! I decided to start a series of blog posts on chatbots and conversational agents, diving into the world of AI-powered language models and their capabilities. In my first post, I review Meena, the conversational agent introduced by Google AI. …Deep Learning6 min readDeep Learning6 min read
Published in AIGuys·Jan 14Member-onlyChinchilla: Training Compute-Optimal Large Language ModelsThe article investigates the optimal model size and the number of tokens for training a transformer language model under a given computation budget. They found that current large language models are undertrained, and by training over 400 language models with varying parameters and the number of tokens, they found that…Deep Learning5 min readDeep Learning5 min read
Published in DevOps.dev·Jan 14Member-onlyMLOps project — part 4b: Machine Learning Model MonitoringWe reviewed Evidently.AI and Seldon ALIBI Detect for model monitoring in our previous blog post. I will go through managed services on Google Cloud and AWS to do the model monitoring task. I like to emphasize the importance of MLOps parts using the following photo from a paper published by…Machine Learning13 min readMachine Learning13 min read