Day 10
Section outline
-
April 3rd, Wednesday (10:30-12:30)
Fine-tuning
- Prompt learning
- Retrieval augmented generation
- Large language models and ethics
- Research papers
ChatBots
- Supervised fine-tuning
- Reward modeling from human feedback
- Reinforcement learning training
References
- Jurafsky and Martin, section 10.10
- Slides from lecture
Resources
-
Slides: Training pipeline of GPT assistants like ChatGPT by Andrej Karpathy, 2023. First part only: stop at slide #30.
-
External video: Training pipeline of GPT assistants like ChatGPT by Andrej Karpathy, 2023. First part only: stop at time-lapse 20:17