SOTAVerified

Benchmarking

Papers

Showing 10811090 of 5548 papers

TitleStatusHype
SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural NetworksCode1
ESB: A Benchmark For Multi-Domain End-to-End Speech RecognitionCode1
A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research ChallengesCode1
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator ControlCode1
Graphs, Constraints, and Search for the Abstraction and Reasoning CorpusCode1
An Open-source Benchmark of Deep Learning Models for Audio-visual Apparent and Self-reported Personality RecognitionCode1
KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial DocumentsCode1
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylationsCode1
CAB: Comprehensive Attention Benchmarking on Long Sequence ModelingCode1
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based EnvironmentsCode1
Show:102550
← PrevPage 109 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified