Day 20
Section outline
-
May 15th, Wednesday (10:30-12:30)
Machine translation
- RNN: autoregressive encoder-decoder
- RNN: greedy inference algorithm
- RNN: training algorithm and teacher forcing
- RNN: attention and dynamic context vector
- RNN: dot-product attention
- RNN: bilinear attention
- Transformer-based architecture
- Cross-attention, query, key and value
- Search tree and beam search
- Evaluation: BLEU and METEOR
- NMT and leaderboard
- Parallel corpora
- Research papers
References
- Jurafsky and Martin, chapter 13
- Jurafsky and Martin, chapter 9
- Jurafsky and Martin, chapter 10