I have read in several lecture notes/slides of prominent geostatistics people that kriging is not affected by clusters in data. That kriging de-clusters. But before applying kriging we need to fit a model to the empirical variogram, which is calculated from the "data", that have clusters. So how do we calculate the empirical variogram in the presence of clustering?