Transformers

Transformers

by Giorgio Satta -
Number of replies: 0

Dear All, next week we will start using the so-called Transformer neural networks for NLP.

For those unfamiliar with this architecture, I have included a reference document prepared by a student of mine, Alessandro Viespoli, whose contributions are gratefully acknowledged.

Transformers are also introduced in chapter 8 of the Jurafsky and Martin textbook.