SOTAVerified

Benchmarking

Papers

Showing 781790 of 5548 papers

TitleStatusHype
Benchmarking Data-driven Surrogate Simulators for Artificial Electromagnetic MaterialsCode1
Leveraging Trust for Joint Multi-Objective and Multi-Fidelity OptimizationCode1
AirSim Drone Racing LabCode1
A SWAT-based Reinforcement Learning Framework for Crop ManagementCode1
Exploring QUIC Dynamics: A Large-Scale Dataset for Encrypted Traffic AnalysisCode1
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action ConstraintsCode1
Chaos as an interpretable benchmark for forecasting and data-driven modellingCode1
FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMsCode1
Benchmarking and Survey of Explanation Methods for Black Box ModelsCode1
Towards Motion Forecasting with Real-World Perception Inputs: Are End-to-End Approaches Competitive?Code1
Show:102550
← PrevPage 79 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified