SOTAVerified|Agents Browse Leaderboard About Blog

Interpretability Techniques for Deep Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 25 papers

Title	Date	Tasks	Status	Hype
Time series saliency maps: explaining models across multiple domains	May 19, 2025	Explainable Artificial Intelligence (XAI)Interpretability Techniques for Deep Learning	CodeCode Available	1
Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability	Mar 26, 2025	Age/UnbiasedDecision Making	CodeCode Available	1
IBO: Inpainting-Based Occlusion to Enhance Explainable Artificial Intelligence Evaluation in Histopathology	Aug 29, 2024	ClassificationDenoising	CodeCode Available	0
Explainable Deep Learning: A Visual Analytics Approach with Transition Matrices	Mar 29, 2024	Deep LearningExplainable Artificial Intelligence (XAI)	CodeCode Available	0
CausalGym: Benchmarking causal interpretability methods on linguistic tasks	Feb 19, 2024	BenchmarkingInterpretability Techniques for Deep Learning	CodeCode Available	2
Less is More: Fewer Interpretable Region via Submodular Subset Selection	Feb 14, 2024	Error UnderstandingImage Attribution	CodeCode Available	2
TraceFL: Interpretability-Driven Debugging in Federated Learning via Neuron Provenance	Dec 21, 2023	Explainable ModelsFault localization	CodeCode Available	1
Improving Interpretability via Regularization of Neural Activation Sensitivity	Nov 16, 2022	Adversarial RobustnessExplanation Fidelity Evaluation	—Unverified	0
Making Sense of Dependence: Efficient Black-box Explanations Using Dependence Measure	Jun 13, 2022	Error UnderstandingImage Attribution	CodeCode Available	1
Learning the Dynamics of Physical Systems from Sparse Observations with Finite Element Networks	Mar 16, 2022	Graph Neural NetworkInterpretability Techniques for Deep Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 3Next →

All datasets CausalGym CelebA

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DAS	Log odds-ratio (pythia-6.9b)	9.95	—	Unverified
2	Linear probe	Log odds-ratio (pythia-6.9b)	3.42	—	Unverified
3	Difference-in-means	Log odds-ratio (pythia-6.9b)	2.91	—	Unverified
4	k-means	Log odds-ratio (pythia-6.9b)	1.87	—	Unverified
5	PCA	Log odds-ratio (pythia-6.9b)	1.81	—	Unverified
6	LDA	Log odds-ratio (pythia-6.9b)	0.27	—	Unverified
7	Random	Log odds-ratio (pythia-6.9b)	0.01	—	Unverified