Identifying Spurious Correlations and Correcting them with an Explanation-based Learning Article Swipe
Misgina Tsighe Hagos
,
Kathleen M. Curran
,
Brian Mac Namee
·
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2211.08285
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2211.08285
Identifying spurious correlations learned by a trained model is at the core of refining a trained model and building a trustworthy model. We present a simple method to identify spurious correlations that have been learned by a model trained for image classification problems. We apply image-level perturbations and monitor changes in certainties of predictions made using the trained model. We demonstrate this approach using an image classification dataset that contains images with synthetically generated spurious regions and show that the trained model was overdependent on spurious regions. Moreover, we remove the learned spurious correlations with an explanation based learning approach.
Related Topics To Compare & Contrast
Vs
Epistemology
Concepts
Spurious relationship
Artificial intelligence
Computer science
Trustworthiness
Image (mathematics)
Pattern recognition (psychology)
Machine learning
Simple (philosophy)
Epistemology
Philosophy
Computer security
Metadata
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2211.08285
- https://arxiv.org/pdf/2211.08285
- OA Status
- green
- Cited By
- 5
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4309213162
All OpenAlex metadata
Raw OpenAlex JSON
No additional metadata available.