SOTAVerified

Benchmarking

Papers

Showing 281290 of 5548 papers

TitleStatusHype
Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern AnalysisCode2
State-specific protein-ligand complex structure prediction with a multi-scale deep generative modelCode2
LLM-Based Multi-Agent Systems are Scalable Graph Generative ModelsCode2
EasyTPP: Towards Open Benchmarking Temporal Point ProcessesCode2
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationCode2
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code GenerationCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
Deep Visual Geo-localization BenchmarkCode2
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion TransferCode2
A Survey on Multimodal Benchmarks: In the Era of Large AI ModelsCode2
Show:102550
← PrevPage 29 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified