SOTAVerified

Benchmarking

Papers

Showing 22012210 of 5548 papers

TitleStatusHype
Comparative analysis of neural network architectures for short-term FOREX forecasting0
Benchmarking Retrieval-Augmented Large Language Models in Biomedical NLP: Application, Robustness, and Self-Awareness0
oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving0
NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity RecognitionCode0
Replication Study and Benchmarking of Real-Time Object Detection ModelsCode0
Benchmarking Cross-Domain Audio-Visual Deception Detection0
Benchmarking Classical and Learning-Based Multibeam Point Cloud RegistrationCode1
Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs0
Are EEG-to-Text Models Working?Code3
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression ToolkitCode4
Show:102550
← PrevPage 221 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified