SOTAVerified

Interpretability Techniques for Deep Learning

Papers

Showing 110 of 25 papers

TitleStatusHype
CausalGym: Benchmarking causal interpretability methods on linguistic tasksCode2
Less is More: Fewer Interpretable Region via Submodular Subset SelectionCode2
A Novel Deep Learning Model for Hotel Demand and Revenue Prediction amid COVID-19Code1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
Axiomatic Attribution for Deep NetworksCode1
DISSECT: Disentangled Simultaneous Explanations via Concept TraversalsCode1
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationCode1
Exploration of Interpretability Techniques for Deep COVID-19 Classification using Chest X-ray ImagesCode1
A Unified Approach to Interpreting Model PredictionsCode1
Learning the Dynamics of Physical Systems from Sparse Observations with Finite Element NetworksCode1
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DASLog odds-ratio (pythia-6.9b)9.95Unverified
2Linear probeLog odds-ratio (pythia-6.9b)3.42Unverified
3Difference-in-meansLog odds-ratio (pythia-6.9b)2.91Unverified
4k-meansLog odds-ratio (pythia-6.9b)1.87Unverified
5PCALog odds-ratio (pythia-6.9b)1.81Unverified
6LDALog odds-ratio (pythia-6.9b)0.27Unverified
7RandomLog odds-ratio (pythia-6.9b)0.01Unverified
#ModelMetricClaimedVerifiedStatus
1RISEInsertion AUC score0.57Unverified
2HSIC-AttributionInsertion AUC score0.57Unverified
3Kernel SHAPInsertion AUC score0.52Unverified
4LIMEInsertion AUC score0.52Unverified
5SaliencyInsertion AUC score0.46Unverified
6Grad-CAMInsertion AUC score0.37Unverified
7Integrated GradientsInsertion AUC score0.36Unverified