SOTAVerified

Benchmarking

Papers

Showing 15711580 of 5548 papers

TitleStatusHype
shapiq: Shapley Interactions for Machine LearningCode4
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs0
Deep learning for action spotting in association football videos0
Benchmarking Large Language Models for Conversational Question Answering in Multi-instructional Documents0
FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks0
CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset0
Exploring QUIC Dynamics: A Large-Scale Dataset for Encrypted Traffic AnalysisCode1
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity LearningCode0
Benchmarking Adaptive Intelligence and Computer Vision on Human-Robot Collaboration0
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs0
Show:102550
← PrevPage 158 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified