SOTAVerified

Benchmarking

Papers

Showing 38813890 of 5548 papers

TitleStatusHype
Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection0
SASSE: Scalable and Adaptable 6-DOF Pose Estimation0
SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas0
SAWNet: A Spatially Aware Deep Neural Network for 3D Point Cloud Processing0
Scaffold Splits Overestimate Virtual Screening Performance0
Scalable and Customizable Benchmark Problems for Many-Objective Optimization0
Scalable and Hybrid Ensemble-Based Causality Discovery0
Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency0
Scalable Psychological Momentum Forecasting in Esports0
Automated Coding of Communications in Collaborative Problem-solving Tasks Using ChatGPT0
Show:102550
← PrevPage 389 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified