SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2121–2130 of 5548 papers

Title	Date	Tasks	Status	Hype
EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data	Apr 12, 2022	BenchmarkingSegmentation	—Unverified	0
EXACT: Towards a platform for empirically benchmarking Machine Learning model explanation methods	May 20, 2024	BenchmarkingExplainable artificial intelligence	—Unverified	0
Explicitly Multi-Modal Benchmarks for Multi-Objective Optimization	Oct 7, 2021	Benchmarking	—Unverified	0
CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations	Oct 2, 2024	BenchmarkingLong Form Question Answering	—Unverified	0
Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report	Oct 5, 2023	Benchmarking	—Unverified	0
CAFA-evaluator: A Python Tool for Benchmarking Ontological Classification Methods	Oct 10, 2023	BenchmarkingPrediction	—Unverified	0
Analyzing Multilingual Competency of LLMs in Multi-Turn Instruction Following: A Case Study of Arabic	Oct 23, 2023	BenchmarkingInstruction Following	—Unverified	0
Quantum Similarity Testing with Convolutional Neural Networks	Nov 3, 2022	Benchmarking	—Unverified	0
Ev-Layout: A Large-scale Event-based Multi-modal Dataset for Indoor Layout Estimation and Tracking	Mar 11, 2025	Benchmarking	—Unverified	0
Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering	Sep 13, 2024	BenchmarkingBinary Classification	—Unverified	0

Show:10 25 50

← PrevPage 213 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified