SOTAVerified

Benchmarking

Papers

Showing 241250 of 5548 papers

TitleStatusHype
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationCode2
State-specific protein-ligand complex structure prediction with a multi-scale deep generative modelCode2
Fast Vision Transformers with HiLo AttentionCode2
Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)Code2
A Content-Driven Micro-Video Recommendation Dataset at ScaleCode2
MMLongBench-Doc: Benchmarking Long-context Document Understanding with VisualizationsCode2
Deep Visual Geo-localization BenchmarkCode2
A large annotated medical image dataset for the development and evaluation of segmentation algorithmsCode2
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion TransferCode2
Show:102550
← PrevPage 25 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified