Section outline

  • May 15th, Wednesday (10:30-12:30)

    Machine translation

    • RNN: autoregressive encoder-decoder
    • RNN: greedy inference algorithm
    • RNN: training algorithm and teacher forcing
    • RNN: attention and dynamic context vector
    • RNN: dot-product attention
    • RNN: bilinear attention
    • Transformer-based architecture
    • Cross-attention, query, key and value
    • Search tree and beam search
    • Evaluation: BLEU and METEOR
    • NMT and leaderboard
    • Parallel corpora
    • Research papers

    References

    • Jurafsky and Martin, chapter 13
    • Jurafsky and Martin, chapter 9
    • Jurafsky and Martin, chapter 10