SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2201–2210 of 5548 papers

Title	Date	Tasks	Status	Hype
Comparative analysis of neural network architectures for short-term FOREX forecasting	May 13, 2024	Benchmarking	—Unverified	0
Benchmarking Retrieval-Augmented Large Language Models in Biomedical NLP: Application, Robustness, and Self-Awareness	May 13, 2024	Benchmarkingcounterfactual	—Unverified	0
oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving	May 13, 2024	AttributeAutonomous Driving	—Unverified	0
NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition	May 13, 2024	Benchmarkingnamed-entity-recognition	CodeCode Available	0
Replication Study and Benchmarking of Real-Time Object Detection Models	May 11, 2024	Benchmarkingobject-detection	CodeCode Available	0
Benchmarking Cross-Domain Audio-Visual Deception Detection	May 11, 2024	BenchmarkingDeception Detection	—Unverified	0
Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration	May 10, 2024	BenchmarkingPoint Cloud Registration	CodeCode Available	1
Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs	May 10, 2024	BenchmarkingHyperparameter Optimization	—Unverified	0
Are EEG-to-Text Models Working?	May 10, 2024	BenchmarkingEEG	CodeCode Available	3
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit	May 9, 2024	BenchmarkingComputational Efficiency	CodeCode Available	4

Show:10 25 50

← PrevPage 221 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified