SOTAVerified

Benchmarking

Papers

Showing 22512260 of 5548 papers

TitleStatusHype
Dynamic Neighborhood Construction for Structured Large Discrete Action SpacesCode0
Harmonization Benchmarking Tool for Neuroimaging DatasetsCode0
Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation FrameworkCode0
Guidelines and Benchmarks for Deployment of Deep Learning Models on Smartphones as Real-Time AppsCode0
Benchmarking Multimodal CoT Reward Model Stepwise by Visual ProgramCode0
A Seq2Seq approach to Symbolic RegressionCode0
Grounding Synthetic Data Evaluations of Language Models in Unsupervised Document CorporaCode0
gym-gazebo2, a toolkit for reinforcement learning using ROS 2 and GazeboCode0
Harnessing Orthogonality to Train Low-Rank Neural NetworksCode0
Benchmarking Multilabel Topic Classification in the Kyrgyz LanguageCode0
Show:102550
← PrevPage 226 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified