SOTAVerified

Benchmarking

Papers

Showing 29612970 of 5548 papers

TitleStatusHype
ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph0
EconGym: A Scalable AI Testbed with Diverse Economic Tasks0
EconWebArena: Benchmarking Autonomous Agents on Economic Tasks in Realistic Web Environments0
Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey0
Edge-First Language Model Inference: Models, Metrics, and Tradeoffs0
EdgeMark: An Automation and Benchmarking System for Embedded Artificial Intelligence Tools0
EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods0
EEGS: A Transparent Model of Emotions0
EffCNet: An Efficient CondenseNet for Image Classification on NXP BlueBox0
Effective Evaluation of Deep Active Learning on Image Classification Tasks0
Show:102550
← PrevPage 297 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified