Reinforcement Learning from Human Feedback, InstructGPT, and ChatGPT

Isaac Kargar
AIGuys
Published in
9 min readJan 7, 2023

--

Note: some parts of this blog post are generated by ChatGPT! :)

Welcome to my blog post on ChatGPT! In this post, we will dive into the inner workings of ChatGPT and how it is trained. However, before we get into the specifics of ChatGPT, it’s important to first review some relevant prior works and concepts to give us a strong foundation. Once we have a solid understanding of these foundations, we can move on to exploring ChatGPT in depth.

--

--

Isaac Kargar
AIGuys

Co-Founder and CIO @ Resoniks | Ph.D. candidate at the Intelligent Robotics Group at Aalto University | https://kargarisaac.github.io/