Reinforcement Learning from Human Feedback, InstructGPT, and ChatGPT

Published in

AIGuys

9 min readJan 7, 2023

Note: some parts of this blog post are generated by ChatGPT! :)

Welcome to my blog post on ChatGPT! In this post, we will dive into the inner workings of ChatGPT and how it is trained. However, before we get into the specifics of ChatGPT, it’s important to first review some relevant prior works and concepts to give us a strong foundation. Once we have a solid understanding of these foundations, we can move on to exploring ChatGPT in depth.

Reinforcement Learning from Human Feedback, InstructGPT, and ChatGPT

Written by Isaac Kargar