SOTAVerified

Benchmarking

Papers

Showing 591600 of 5548 papers

TitleStatusHype
BEACON: A Benchmark for Efficient and Accurate Counting of Subgraphs0
BoTTA: Benchmarking on-device Test Time Adaptation0
Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization0
COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts0
LMFormer: Lane based Motion Prediction Transformer0
Benchmarking 3D Human Pose Estimation Models Under Occlusions0
CameraBench: Benchmarking Visual Reasoning in MLLMs via Photography0
TinyverseGP: Towards a Modular Cross-domain Benchmarking Framework for Genetic ProgrammingCode1
Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models0
Trade-offs in Privacy-Preserving Eye Tracking through Iris Obfuscation: A Benchmarking StudyCode0
Show:102550
← PrevPage 60 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified