Section: Day 06 | INQ0091105 - NATURAL LANGUAGE PROCESSING 2023-2024 | STEM

Macroarea STEM

Home Calendar Unipd Educational Offer Timetables Uniweb Webmail My Media

Section outline

March 15th, Friday (10:30-12:30)

Language models

Out-of-vocabulary words

Limitations of N-gram model

Research papers

Neural language models (NLM)

General architecture for NLM

Feedforward NLM: inference

Feedforward NLM: training

Recurrent NLM: inference

Exercises

Subword tokenization: BPE algorithm

References

Jurafsky and Martin, chapter 3

Voita, NLP Course | For You (web course): Language Modeling

Jurafsky and Martin, section 7.5

Jurafsky and Martin, section 7.7

Jurafsky and Martin, section 9.2