SOTAVerified

Benchmarking

Papers

Showing 42714280 of 5548 papers

TitleStatusHype
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution0
Surface Reconstruction from Point Clouds: A Survey and a Benchmark0
Learn-to-Race Challenge 2022: Benchmarking Safe Learning and Cross-domain Generalisation in Autonomous Racing0
On Continual Model Refinement in Out-of-Distribution Data Streams0
Training Mixed-Domain Translation Models via Federated Learning0
MSAMSum: Towards Benchmarking Multi-lingual Dialogue SummarizationCode0
MMCoQA: Conversational Question Answering over Text, Tables, and ImagesCode0
Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset for Narrative Comprehension0
To Find Waldo You Need Contextual Cues: Debiasing Who’s WaldoCode0
Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny DetectionCode0
Show:102550
← PrevPage 428 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified