SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3371–3380 of 5548 papers

Title	Date	Tasks	Status	Hype	Score
Lightweight Jet Reconstruction and Identification as an Object Detection Task	Feb 9, 2022	Benchmarkingobject-detection	—Unverified	0	0
Solving excited states for long-range interacting trapped ions with neural networks	Jun 10, 2025	Benchmarking	—Unverified	0	0
Top Score on the Wrong Exam: On Benchmarking in Machine Learning for Vulnerability Detection	Aug 23, 2024	BenchmarkingBinary Classification	—Unverified	0	0
Benchmarking Multi-National Value Alignment for Large Language Models	Apr 17, 2025	Benchmarking	—Unverified	0	0
LIM: Large Interpolator Model for Dynamic Reconstruction	Mar 28, 2025	4D reconstructionBenchmarking	—Unverified	0	0
Advanced Manufacturing Configuration by Sample-efficient Batch Bayesian Optimization	May 24, 2022	Bayesian OptimizationBenchmarking	—Unverified	0	0
Line Goes Up? Inherent Limitations of Benchmarks for Evaluating Large Language Models	Feb 20, 2025	Benchmarking	—Unverified	0	0
Liquid State Genetic Programming	Dec 5, 2023	Benchmarking	—Unverified	0	0
Livestock Monitoring with Transformer	Nov 1, 2021	Action RecognitionBenchmarking	—Unverified	0	0
Benchmarking Multimodal Sentiment Analysis	Jul 29, 2017	BenchmarkingEmotion Recognition	—Unverified	0	0

Show:10 25 50

← PrevPage 338 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified