SOTAVerified

Benchmarking

Papers

Showing 30413050 of 5548 papers

TitleStatusHype
Improving Items and Contexts Understanding with Descriptive Graph for Conversational Recommendation0
Improving Medical Image Classification with Label Noise Using Dual-uncertainty Estimation0
Improving Model Generalization: A Chinese Named Entity Recognition Case Study0
Improving Named Entity Linking Corpora Quality0
Improving plant disease classification by adaptive minimal ensembling0
The Paradox of Success in Evolutionary and Bioinspired Optimization: Revisiting Critical Issues, Key Studies, and Methodological Pathways0
Improving Reference-based Distinctive Image Captioning with Contrastive Rewards0
Improving seasonal forecast using probabilistic deep learning0
The ParClusterers Benchmark Suite (PCBS): A Fine-Grained Analysis of Scalable Graph Clustering0
Improving the Validity and Practical Usefulness of AI/ML Evaluations Using an Estimands Framework0
Show:102550
← PrevPage 305 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified