SOTAVerified

Benchmarking

Papers

Showing 40414050 of 5548 papers

TitleStatusHype
Dual Task Framework for Improving Persona-grounded Dialogue Dataset0
High Fidelity RF Clutter Modeling and Simulation0
Lightweight Jet Reconstruction and Identification as an Object Detection Task0
BIQ2021: A Large-Scale Blind Image Quality Assessment Database0
ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core LearningCode1
Comparative Study Between Distance Measures On Supervised Optimum-Path Forest ClassificationCode0
What are the best systems? New perspectives on NLP BenchmarkingCode1
RECOVER: sequential model optimization platform for combination drug repurposing identifies novel synergistic compounds in vitroCode1
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm ConfigurationCode0
Benchmarking and Analyzing Point Cloud Classification under CorruptionsCode1
Show:102550
← PrevPage 405 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified