SOTAVerified

Benchmarking

Papers

Showing 23712380 of 5548 papers

TitleStatusHype
Unifying Large Language Model and Deep Reinforcement Learning for Human-in-Loop Interactive Socially-aware Navigation0
Broadening the Scope of Neural Network Potentials through Direct Inclusion of Additional Molecular Attributes0
Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos0
Can 3D Vision-Language Models Truly Understand Natural Language?Code1
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization CorrelationsCode1
RoDLA: Benchmarking the Robustness of Document Layout Analysis ModelsCode1
ChatGPT Alternative Solutions: Large Language Models Survey0
DomainLab: A modular Python package for domain generalization in deep learningCode1
Practical End-to-End Optical Music Recognition for Pianoform MusicCode1
MARTA: a model for the automatic phonemic grouping of the parkinsonian speechCode0
Show:102550
← PrevPage 238 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified