SOTAVerified

Benchmarking

Papers

Showing 38913900 of 5548 papers

TitleStatusHype
Benchmarking Causal Study to Interpret Large Language Models for Source Code0
Object Detection based on LIDAR Temporal Pulses using Spiking Neural Networks0
Benchmarking Burst Super-Resolution for Polarization Images: Noise Dataset and Analysis0
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment0
Benchmarking BioRelEx for Entity Tagging and Relation Extraction0
Benchmarking Biopharmaceuticals Retrieval-Augmented Generation Evaluation0
OctoPath: An OcTree Based Self-Supervised Learning Approach to Local Trajectory Planning for Mobile Robots0
Benchmarking Biomedical Nested NER and Relation Extraction Models0
OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking0
Benchmarking Bias in Large Language Models during Role-Playing0
Show:102550
← PrevPage 390 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified