SOTAVerified

Fidelity Isn't Accuracy: When Linearly Decodable Functions Fail to Match the Ground Truth

2025-06-13Code Available0· sign in to hype

Jackson Eshbaugh

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Neural networks excel as function approximators, but their complexity often obscures the nature of the functions they learn. In this work, we propose the linearity score (f), a simple and interpretable diagnostic that quantifies how well a regression network's output can be mimicked by a linear model. Defined as the R^2 between the network's predictions and those of a trained linear surrogate, (f) offers insight into the linear decodability of the learned function. We evaluate this framework on both synthetic (y = x (x) + ) and real-world datasets (Medical Insurance, Concrete, California Housing), using dataset-specific networks and surrogates. Our findings show that while high (f) scores indicate strong linear alignment, they do not necessarily imply predictive accuracy with respect to the ground truth. This underscores both the promise and the limitations of using linear surrogates to understand nonlinear model behavior, particularly in high-stakes regression tasks.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
California Housing PricesMimic / SurrogateR2 Score0.57Unverified
California Housing PricesBaseline RegressionR2 Score0.58Unverified
California Housing PricesNeural NetworkR2 Score0.79Unverified
Concrete Compressive StrengthNeural NetworkR2 Score0.86Unverified
Concrete Compressive StrengthBaseline RegressionR2 Score0.59Unverified
Concrete Compressive StrengthMimic / SurrogateR2 Score0.58Unverified
Medical Cost Personal DatasetMimic / SurrogateR2 Score0.78Unverified
Medical Cost Personal DatasetBaseline RegressionR2 Score0.78Unverified
Medical Cost Personal DatasetNeural NetworkR2 Score0.87Unverified
Synthetic: y = x * sin xNeural NetworkR2 Score0.98Unverified
Synthetic: y = x * sin xBaseline RegressionR2 Score-0.01Unverified
Synthetic: y = x * sin xMimic / SurrogateR2 Score-0.01Unverified

Reproductions