SOTAVerified

Benchmarking

Papers

Showing 110 of 5548 papers

TitleStatusHype
Visual Place Recognition for Large-Scale UAV Applications0
Disentangling coincident cell events using deep transfer learning and compressive sensing0
Training Transformers with Enforced Lipschitz Constants0
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action RecognitionCode0
DCR: Quantifying Data Contamination in LLMs EvaluationCode0
A Multi-View High-Resolution Foot-Ankle Complex Point Cloud Dataset During Gait for Occlusion-Robust 3D Completion0
FLsim: A Modular and Library-Agnostic Simulation Framework for Federated LearningCode0
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil EngineeringCode2
CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks0
Show:102550
← PrevPage 1 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified