Section outline

  • April 23th, Thursday (10:30-12:30)

    Large language models and post-training

    • Preference-based learning
    • Modeling preferences
    • Learning to score preferences
    • LLM alignment via preference learning
    • Direct preference optimization
    • Parameter efficient fine-tuning: adapters
    • Parameter efficient fine-tuning: LoRA
    • Transfer learning

    References

    • Jurafsky and Martin, chapter 10
    • Jurafsky and Martin, chapter 8
    • Voita, NLP Course | For You (web course): Transfer Learning
    • Slides from lecture