NeuroCodeBench: a plain C neural network benchmark for software verification

2023-09-07Code Available0· sign in to hype

Edoardo Manino, Rafael Sá Menezes, Fedor Shmarov, Lucas C. Cordeiro

Code Available — Be the first to reproduce this paper.

Code

github.com/emanino/plain_c_nn_benchmark
OfficialIn papernone★ 5
github.com/Yiannis128/plain_c_nn_benchmark
none★ 0

Abstract

Safety-critical systems with neural network components require strong guarantees. While existing neural network verification techniques have shown great progress towards this goal, they cannot prove the absence of software faults in the network implementation. This paper presents NeuroCodeBench - a verification benchmark for neural network code written in plain C. It contains 32 neural networks with 607 safety properties divided into 6 categories: maths library, activation functions, error-correcting networks, transfer function approximation, probability density estimation and reinforcement learning. Our preliminary evaluation shows that state-of-the-art software verifiers struggle to provide correct verdicts, due to their incomplete support of the standard C mathematical library and the complexity of larger neural networks.

Tasks

Density Estimation

NeuroCodeBench: a plain C neural network benchmark for software verification

Code

Abstract

Tasks

Reproductions