Day 06
Section outline
-
March 15th, Friday (10:30-12:30)
Language models
- Out-of-vocabulary words
- Limitations of N-gram model
- Research papers
Neural language models (NLM)
- General architecture for NLM
- Feedforward NLM: inference
- Feedforward NLM: training
- Recurrent NLM: inference
Exercises
- Subword tokenization: BPE algorithm
References
- Jurafsky and Martin, chapter 3
- Voita, NLP Course | For You (web course): Language Modeling
- Jurafsky and Martin, section 7.5
- Jurafsky and Martin, section 7.7
- Jurafsky and Martin, section 9.2