The problem arises when I tried to implement the NB algorithm on the MNIST dataset. Since because it has several attributes/features that are of the same value, how to incorporate this part of the distribution in NB. I looked up the implementations of NB on MNIST datasets but could not able to learn how they tackle this problem.