SOTAVerified

Benchmarking

Papers

Showing 20212030 of 5548 papers

TitleStatusHype
DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection0
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi0
CayleyPy RL: Pathfinding and Reinforcement Learning on Cayley Graphs0
Benchmarking and Evaluation of AI Models in Biology: Outcomes and Recommendations from the CZI Virtual Cells Workshop0
Deep Generative Models for Physiological Signals: A Systematic Literature Review0
Deep Hedging of Long-Term Financial Derivatives0
An EEG-based Stereoscopic Research to Reveal the Brain's Response to What Happens Before and After Watching 2D and 3D Movies0
Deep Imputation of Missing Values in Time Series Health Data: A Review with Benchmarking0
CausalRivers -- Scaling up benchmarking of causal discovery for real-world time-series0
Benchmarking and Error Diagnosis in Multi-Instance Pose Estimation0
Show:102550
← PrevPage 203 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified