SOTAVerified

Benchmarking

Papers

Showing 34413450 of 5548 papers

TitleStatusHype
Low-resource Neural Machine Translation: Benchmarking State-of-the-art Transformer for Wolof<->French0
LSTM-based Whisper Detection0
Benchmarking M6 Competitors: An Analysis of Financial Metrics and Discussion of Incentives0
LucidDreaming: Controllable Object-Centric 3D Generation0
Benchmarking LLMs on the Semantic Overlap Summarization Task0
LUND-PROBE -- LUND Prostate Radiotherapy Open Benchmarking and Evaluation dataset0
Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders0
Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving0
Benchmarking LLMs in Political Content Text-Annotation: Proof-of-Concept with Toxicity and Incivility Data0
M3Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes0
Show:102550
← PrevPage 345 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified