SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1531–1540 of 5548 papers

Title	Date	Tasks	Status	Hype	Score
Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?	May 19, 2021	BenchmarkingSentence	CodeCode Available	0	5
RUHSNet: 3D Object Detection Using Lidar Data in Real Time	May 9, 2020	3D Object DetectionAutonomous Vehicles	CodeCode Available	0	5
LABCAT: Locally adaptive Bayesian optimization using principal-component-aligned trust regions	Nov 19, 2023	Bayesian OptimizationBenchmarking	CodeCode Available	0	5
Benchmarking Federated Learning for Semantic Datasets: Federated Scene Graph Generation	Dec 11, 2024	BenchmarkingFederated Learning	CodeCode Available	0	5
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation	May 4, 2025	BenchmarkingFeature Upsampling	CodeCode Available	0	5
Adversarial Metric Attack and Defense for Person Re-identification	Jan 30, 2019	Adversarial AttackBenchmarking	CodeCode Available	0	5
Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical Optimization	Sep 17, 2021	Benchmarking	CodeCode Available	0	5
Benchmarking Failures in Tool-Augmented Language Models	Mar 18, 2025	BenchmarkingText Generation	CodeCode Available	0	5
Knowledge-Driven Slot Constraints for Goal-Oriented Dialogue Systems	Jun 1, 2021	BenchmarkingGoal-Oriented Dialogue Systems	CodeCode Available	0	5
Knowing-how & Knowing-that: A New Task for Machine Comprehension of User Manuals	Jun 7, 2023	BenchmarkingMachine Reading Comprehension	CodeCode Available	0	5

Show:10 25 50

← PrevPage 154 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified