Section: Day 06 | NATURAL LANGUAGE PROCESSING 2025-2026 - INQ0091105 | STEM

Macroarea STEM

Home Calendar Unipd Educational Offer Timetables Uniweb Webmail My Media

Section outline

March 20th, Friday (10:30-12:30)

Statistical language models

Linear interpolation

Out-of-vocabulary words

Limitations of N-gram model

Neural language models (NLM)

General architecture for NLM

Feedforward NLM: inference

Feedforward NLM: training

Recurrent NLM: inference

Recurrent NLM: training

Exercises

Subword tokenization: BPE algorithm

References

Jurafsky and Martin, section 6.5

Jurafsky and Martin, section 13.2

Voita, NLP Course | For You (web course): Language Modeling

Resources
- Select activity 05b_neural_language_modeling
  
  05b_neural_language_modeling File