SOTAVerified

Benchmarking

Papers

Showing 691700 of 5548 papers

TitleStatusHype
Benchmarking LLMs' Swarm intelligenceCode1
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and ObjectsCode1
Align and Distill: Unifying and Improving Domain Adaptive Object DetectionCode1
Deep learning model solves change point detection for multiple change typesCode1
Deep Learning-Based Synchronization for Uplink NB-IoTCode1
Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT ScansCode1
Benchmarking Meaning Representations in Neural Semantic ParsingCode1
DocuMint: Docstring Generation for Python using Small Language ModelsCode1
Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMsCode1
A Comprehensive Study on Large-Scale Graph Training: Benchmarking and RethinkingCode1
Show:102550
← PrevPage 70 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified