SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3921–3930 of 5548 papers

Title	Date	Tasks	Status	Hype	Score
On Continual Model Refinement in Out-of-Distribution Data Streams	May 4, 2022	BenchmarkingContinual Learning	—Unverified	0	0
Active Learning for Community Detection in Stochastic Block Models	May 8, 2016	Active LearningBenchmarking	—Unverified	0	0
On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events	Dec 9, 2024	BenchmarkingComputational Efficiency	—Unverified	0	0
Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos	Jan 1, 2024	Benchmarking	—Unverified	0	0
On Distribution Grid Optimal Power Flow Development and Integration	Dec 9, 2022	Benchmarking	—Unverified	0	0
ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities	Dec 9, 2024	AllBenchmarking	—Unverified	0	0
One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision	Feb 3, 2021	BenchmarkingFairness	—Unverified	0	0
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese	May 16, 2025	BenchmarkingLanguage Modeling	—Unverified	0	0
One of these (Few) Things is Not Like the Others	May 22, 2020	BenchmarkingFew-Shot Learning	—Unverified	0	0
Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios	Apr 16, 2025	Audio Deepfake DetectionBenchmarking	—Unverified	0	0

Show:10 25 50

← PrevPage 393 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified