You may take a look at these manuscripts to understand these models.
https://arxiv.org/pdf/1301.3781v3.pdf
https://aclanthology.org/D14-1162.pdf
It's difficult to say which one performed well for text classification. I think it entirely depends on your dataset. Better explore each method and analyze which one performs well.
All mentioned methods are used for words, not for texts. Of course, you can calculate the sum of the individual word vectors in a text and then use a classifier.
Depending on the type of text and classification task, you might not need word embeddings (use TF/IDF) or get better results with finetuning a contextualized BERT model.