Lab Session I: word embeddings
Section outline
-
March 22nd, Friday (16:30-18:30)
Using pretrained word embeddings
- Introduction to the gensim library
- Common operations with word embeddings: lookup, similarity, NN retrieval
- Visualizing word embeddings: dimensionality reduction with PCA
- Intrisic evaluation of word embeddings: word similarity and word analogy benchmarks
Pretraining word embeddings
- Using gensim to pretrain word embeddings (Word2Vec style)
- Saving and loading embeddings
Extrinsic evaluation of word embeddings
- Using word2vec representations for spam classification
Resources