SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3891–3900 of 5548 papers

Title	Date	Tasks	Status	Hype	Score
Benchmarking Causal Study to Interpret Large Language Models for Source Code	Aug 23, 2023	BenchmarkingCausal Inference	—Unverified	0	0
Object Detection based on LIDAR Temporal Pulses using Spiking Neural Networks	Oct 29, 2018	Autonomous DrivingBenchmarking	—Unverified	0	0
Benchmarking Burst Super-Resolution for Polarization Images: Noise Dataset and Analysis	Mar 24, 2025	BenchmarkingImage Reconstruction	—Unverified	0	0
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment	Aug 6, 2019	Atari GamesBenchmarking	—Unverified	0	0
Benchmarking BioRelEx for Entity Tagging and Relation Extraction	May 31, 2020	BenchmarkingRelation	—Unverified	0	0
Benchmarking Biopharmaceuticals Retrieval-Augmented Generation Evaluation	Apr 15, 2025	BenchmarkingQuestion Answering	—Unverified	0	0
OctoPath: An OcTree Based Self-Supervised Learning Approach to Local Trajectory Planning for Mobile Robots	Jun 2, 2021	BenchmarkingDecoder	—Unverified	0	0
Benchmarking Biomedical Nested NER and Relation Extraction Models	Oct 16, 2021	BenchmarkingNER	—Unverified	0	0
OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking	Jul 19, 2024	BenchmarkingMulti-Object Tracking	—Unverified	0	0
Benchmarking Bias in Large Language Models during Role-Playing	Nov 1, 2024	BenchmarkingFairness	—Unverified	0	0

Show:10 25 50

← PrevPage 390 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified