SOTAVerified

Benchmarking

Papers

Showing 26812690 of 5548 papers

TitleStatusHype
TuringQ: Benchmarking AI Comprehension in Theory of ComputationCode0
OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB0
Benchmarking Data Heterogeneity Evaluation Approaches for Personalized Federated LearningCode0
InAttention: Linear Context Scaling for Transformers0
Analysis of different disparity estimation techniques on aerial stereo image datasets0
HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding0
M3Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes0
Active Evaluation Acquisition for Efficient LLM Benchmarking0
Benchmarking of a new data splitting method on volcanic eruption data0
Manual Verbalizer Enrichment for Few-Shot Text Classification0
Show:102550
← PrevPage 269 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified