Section outline

  • April 3rd, Wednesday (10:30-12:30)

    Fine-tuning

    • Prompt learning
    • Retrieval augmented generation
    • Large language models and ethics
    • Research papers

    ChatBots

    • Supervised fine-tuning
    • Reward modeling from human feedback
    • Reinforcement learning training

    References

    • Jurafsky and Martin, section 10.10
    • Slides from lecture

    Resources

    • Slides: Training pipeline of GPT assistants like ChatGPT by Andrej Karpathy, 2023. First part only: stop at slide #30.

    • External video: Training pipeline of GPT assistants like ChatGPT by Andrej Karpathy, 2023. First part only: stop at time-lapse 20:17