SOTAVerified

Benchmarking

Papers

Showing 22212230 of 5548 papers

TitleStatusHype
Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language ModelsCode0
iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image RetrievalCode2
Performance Evaluation of Real-Time Object Detection for Electric ScootersCode0
PhilHumans: Benchmarking Machine Learning for Personal Health0
Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?Code1
Systematic Review: Anomaly Detection in Connected and Autonomous Vehicles0
A Normative Framework for Benchmarking Consumer Fairness in Large Language Model Recommender System0
Single and Multi-Hop Question-Answering Datasets for Reticular Chemistry with GPT-4-TurboCode0
Toward end-to-end interpretable convolutional neural networks for waveform signals0
CityLearn v2: Energy-flexible, resilient, occupant-centric, and carbon-aware management of grid-interactive communities0
Show:102550
← PrevPage 223 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified