SOTAVerified

Benchmarking

Papers

Showing 25812590 of 5548 papers

TitleStatusHype
Forecasting time series with constraintsCode0
Delving into Instance-Dependent Label Noise in Graph Data: A Comprehensive Study and BenchmarkCode0
Benchmarking Hierarchical Script KnowledgeCode0
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN ParametersCode0
Delta-Influence: Unlearning Poisons via Influence FunctionsCode0
Forecasting Across Time Series Databases using Recurrent Neural Networks on Groups of Similar Series: A Clustering ApproachCode0
Aesthetic Image Captioning From Weakly-Labelled PhotographsCode0
Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation DifficultyCode0
fMRI-S4: learning short- and long-range dynamic fMRI dependencies using 1D Convolutions and State Space ModelsCode0
DefAn: Definitive Answer Dataset for LLMs Hallucination EvaluationCode0
Show:102550
← PrevPage 259 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified