After extracting news from different news websites, I need to find the similarities between them. Which are the best techniques I can use for finding document similarity ? I'm planning to use Word2vec + cosine similarity measure so far. Word2vec for converting words into vector space and then apply cosine similarity matrix on that vector space. But I'm not sure I can use both of them together.