SOTAVerified

Benchmarking

Papers

Showing 22612270 of 5548 papers

TitleStatusHype
Open Datasets for Satellite Radio Resource Control0
A User-Centric Multi-Intent Benchmark for Evaluating Large Language ModelsCode1
The Adversarial AI-Art: Understanding, Generation, Detection, and Benchmarking0
EnzChemRED, a rich enzyme chemistry relation extraction dataset0
Experimental Validation of Ultrasound Beamforming with End-to-End Deep Learning for Single Plane Wave ImagingCode1
TAVGBench: Benchmarking Text to Audible-Video GenerationCode1
TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos0
In-situ process monitoring and adaptive quality enhancement in laser additive manufacturing: a critical review0
Authentic Emotion Mapping: Benchmarking Facial Expressions in Real NewsCode0
Bridging the Gap Between Theory and Practice: Benchmarking Transfer Evolutionary Optimization0
Show:102550
← PrevPage 227 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified