Section outline

  • March 15th, Friday (10:30-12:30)

    Language models

    • Out-of-vocabulary words
    • Limitations of N-gram model
    • Research papers

    Neural language models (NLM)

    • General architecture for NLM
    • Feedforward NLM: inference
    • Feedforward NLM: training
    • Recurrent NLM: inference

    Exercises

    • Subword tokenization: BPE algorithm

    References

    • Jurafsky and Martin, chapter 3
    • Voita, NLP Course | For You (web course): Language Modeling
    • Jurafsky and Martin, section 7.5
    • Jurafsky and Martin, section 7.7
    • Jurafsky and Martin, section 9.2