SOTAVerified

Benchmarking

Papers

Showing 44214430 of 5548 papers

TitleStatusHype
Knowledge-Driven Slot Constraints for Goal-Oriented Dialogue SystemsCode0
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM PipelinesCode0
Causality-enhanced Decision-Making for Autonomous Mobile Robots in Dynamic EnvironmentsCode0
Capsule Vision 2024 Challenge: Multi-Class Abnormality Classification for Video Capsule EndoscopyCode0
Language-based Image Colorization: A Benchmark and BeyondCode0
TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language ModelsCode0
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture SearchCode0
Knowing-how & Knowing-that: A New Task for Machine Comprehension of User ManualsCode0
TFW2V: An Enhanced Document Similarity Method for the Morphologically Rich Finnish LanguageCode0
Can Tree Based Approaches Surpass Deep Learning in Anomaly Detection? A Benchmarking StudyCode0
Show:102550
← PrevPage 443 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified