Survey
For a brief survey summarizing skip-gram, CBOW, GloVe, etc, please refer to this.
Code
word2vec: TensorFlow
GloVe: C, TensorFlow
WikiCorpus
Download the WikiCorpus and use the shellscript to process (e.g., remove numbers, invalide chars, urls), leading to sequence of pure words.
Resources
- English word vectors: https://github.com/3Top/word2vec-api
- Non-English word vectors: https://github.com/Kyubyong/wordvectors