SOTAVerified

Benchmarking

Papers

Showing 14511460 of 5548 papers

TitleStatusHype
Towards Motion Forecasting with Real-World Perception Inputs: Are End-to-End Approaches Competitive?Code1
Online Learning with Optimism and DelayCode1
D2S: Document-to-Slide Generation Via Query-Based Text SummarizationCode1
CounselBench: A Large-Scale Expert Evaluation and Adversarial Benchmark of Large Language Models in Mental Health CounselingCode1
CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasksCode1
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures TranslationCode1
CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language ModelsCode1
OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit SynthesisCode1
CovDocker: Benchmarking Covalent Drug Design with Tasks, Datasets, and SolutionsCode1
Benchmarking Graph Neural Networks on Dynamic Link PredictionCode1
Show:102550
← PrevPage 146 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified