Hello,

I wish to apply the NMF algorithm to an extremely sparse matrix (~15% non zero values) with the intent to cluster the data based on the similarity of their feature matrices. Because of the sparsity however, the feature matrices contain a large number of features in order to minimize the reconstruction error, resulting in a difficulty in calculating the similarity between them. How should I approach this problem?

Thanks

Similar questions and discussions