SOTAVerified

Benchmarking

Papers

Showing 39013910 of 5548 papers

TitleStatusHype
Learn-to-Race Challenge 2022: Benchmarking Safe Learning and Cross-domain Generalisation in Autonomous Racing0
Surface Reconstruction from Point Clouds: A Survey and a Benchmark0
Creating a Forensic Database of Shoeprints from Online Shoe Tread PhotosCode1
On Continual Model Refinement in Out-of-Distribution Data Streams0
Training Mixed-Domain Translation Models via Federated Learning0
To Find Waldo You Need Contextual Cues: Debiasing Who’s WaldoCode0
MMCoQA: Conversational Question Answering over Text, Tables, and ImagesCode0
MSAMSum: Towards Benchmarking Multi-lingual Dialogue SummarizationCode0
Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset for Narrative Comprehension0
Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny DetectionCode0
Show:102550
← PrevPage 391 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified