Hello I am student working on NLP project. The top ranked words under a given topic number which is obtained from LDA (Latent Dirchlet allocation), I am trying to assign topic names to each topic number using wikipedia as a knowledge base.
Wikipedia category graph contains links between categories which are having some relationship but do not have an hierarchical structure. From this graph I removed the non hierarchical links to get a DAG (directed acyclic graph) as a consequence a given category can have one or more parent, after this I applied BFS like algorithm to get a taxonomy but this misses relevant hierarchical links.
Is there any factor which I consider to get more accurate and meaningful taxonomy.
Thank you in advance