Lean Formalization of Generalization Error Bound by Rademacher Complexity

2025-03-25Code Available1· sign in to hype

Sho Sonoda, Kazumi Kasaura, Yuma Mizuno, Kei Tsukamoto, Naoto Onda

Code Available — Be the first to reproduce this paper.

Code

github.com/auto-res/lean-rademacher
Officialnone★ 15

Abstract

We formalize the generalization error bound using Rademacher complexity in the Lean 4 theorem prover. Generalization error quantifies the gap between a learning machine's performance on given training data versus unseen test data, and Rademacher complexity serves as an estimate of this error based on the complexity of learning machines, or hypothesis class. Unlike traditional methods such as PAC learning and VC dimension, Rademacher complexity is applicable across diverse machine learning scenarios including deep learning and kernel methods. We formalize key concepts and theorems, including the empirical and population Rademacher complexities, and establish generalization error bounds through formal proofs of McDiarmid's inequality, Hoeffding's lemma, and symmetrization arguments.

Tasks

LEMMA PAC learning

Lean Formalization of Generalization Error Bound by Rademacher Complexity

Code

Abstract

Tasks

Reproductions