Day 14
Section outline
-
April 23th, Thursday (10:30-12:30)
Large language models and post-training
- Preference-based learning
- Modeling preferences
- Learning to score preferences
- LLM alignment via preference learning
- Direct preference optimization
- Parameter efficient fine-tuning: adapters
- Parameter efficient fine-tuning: LoRA
- Transfer learning
References
- Jurafsky and Martin, chapter 10
- Jurafsky and Martin, chapter 8
- Voita, NLP Course | For You (web course): Transfer Learning
- Slides from lecture