Suppose, we bicluster a dataset containing data in row and column fashion about an event. After biclustering, what can we say by interpreting the clustering result?
say you have a set of documents represented in a bag of words fashion
bi-clustering will end up with clusters of documents so that documents in a cluster are distributed in the same way across the clusters of words and that such distributions are different from one cluster to the other (and the same thing holds regarding the clusters of words and their distribution across clusters of documents)
you have built two clusterings (one for the documents and one for the words) which have maximum mutual information with each other