My task deals with multi-language like (english and hindi). For that I need a common embedding to represent both languages.

I know there are methods for learning multilingual embedding like 'MUSE' ( https://github.com/facebookresearch/MUSE ), but this represents those two embeddings in a common vector space, obviously they are similar, but not the same.

So I wanted to know if there is any method or approach that can learn to represent both embedding in form of a single embedding that represents the both the language.

Any lead is strongly appreciated!!!

Similar questions and discussions