SOTAVerified

Benchmarking

Papers

Showing 36213630 of 5548 papers

TitleStatusHype
Learning Quantum Processes with Quantum Statistical QueriesCode0
EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods0
Benchmarking and Improving Generator-Validator Consistency of Language Models0
CoDBench: A Critical Evaluation of Data-driven Models for Continuous Dynamical Systems0
A New Real-World Video Dataset for the Comparison of Defogging Algorithms0
TRAM: Benchmarking Temporal Reasoning for Large Language Models0
Adaptive Visual Scene Understanding: Incremental Scene Graph GenerationCode0
The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks0
Adaptive Control of an Inverted Pendulum by a Reinforcement Learning-based LQR Method0
Benchmarking Collaborative Learning Methods Cost-Effectiveness for Prostate Segmentation0
Show:102550
← PrevPage 363 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified