SOTAVerified

Benchmarking

Papers

Showing 361370 of 5548 papers

TitleStatusHype
State-specific protein-ligand complex structure prediction with a multi-scale deep generative modelCode2
Building Normalizing Flows with Stochastic InterpolantsCode2
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code GenerationCode2
Panoptic Scene Graph GenerationCode2
Why do tree-based models still outperform deep learning on tabular data?Code2
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot LearningCode2
Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and LeaderboardingCode2
DaisyRec 2.0: Benchmarking Recommendation for Rigorous EvaluationCode2
The ArtBench Dataset: Benchmarking Generative Models with ArtworksCode2
Challenges and Opportunities in Offline Reinforcement Learning from Visual ObservationsCode2
Show:102550
← PrevPage 37 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified