Recently, I have been attracted by the paper "Stable learning establishes some common ground between causal inference and machine learning" published in Nature Machine Intelligence journal. After perusing it, I met with a problem regarding the connection between model explainability and spurious correlation.

I notice that in the paper, after introducing the three key factors(stability, explainability and fairness) ML researchers need to address, the authors make a further judgement that spurious correlation is a key source of risk. So far I have figured out why spurious correlation can cause ML models to lack stability and fairness, however, it is still unknown to me why spurious correlation can obstruct the research on explainable AI. As far as I know, there have been two lines of research on XAI, explaining black-box models and building inherently interpretable models. But I'm wondering if there are some concrete explanations about why spurious correlations are such a troublemaker when trying to design good XAL methods?

More Wenzhao Jiang's questions See All
Similar questions and discussions