SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3881–3890 of 5548 papers

Title	Date	Tasks	Status	Hype
Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection	Sep 29, 2023	BenchmarkingDiversity	—Unverified	0
SASSE: Scalable and Adaptable 6-DOF Pose Estimation	Feb 5, 2019	BenchmarkingPose Estimation	—Unverified	0
SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas	May 20, 2025	BenchmarkingLogical Reasoning	—Unverified	0
SAWNet: A Spatially Aware Deep Neural Network for 3D Point Cloud Processing	May 18, 2019	BenchmarkingScene Segmentation	—Unverified	0
Scaffold Splits Overestimate Virtual Screening Performance	Jun 2, 2024	BenchmarkingClustering	—Unverified	0
Scalable and Customizable Benchmark Problems for Many-Objective Optimization	Jan 26, 2020	BenchmarkingPosition	—Unverified	0
Scalable and Hybrid Ensemble-Based Causality Discovery	Dec 24, 2020	BenchmarkingDistributed Computing	—Unverified	0
Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency	Apr 26, 2023	BenchmarkingCloud Computing	—Unverified	0
Scalable Psychological Momentum Forecasting in Esports	Jan 30, 2020	Benchmarking	—Unverified	0
Automated Coding of Communications in Collaborative Problem-solving Tasks Using ChatGPT	Nov 15, 2024	Benchmarking	—Unverified	0

Show:10 25 50

← PrevPage 389 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified