As mentioned in the paper https://nlp.stanford.edu/pubs/glove.pdf, the authors learn two word vectors(one being word vectors W, and another context based vectors W~ ). Why two separate vectors were required and how they are being learned ?

More Ramkrushna Pradhan's questions See All
Similar questions and discussions