SOTAVerified

Benchmarking

Papers

Showing 46514660 of 5548 papers

TitleStatusHype
Mamba-Based Ensemble learning for White Blood Cell ClassificationCode0
Better Late Than Never: Formulating and Benchmarking Recommendation EditingCode0
Better force fields start with better data -- A data set of cation dipeptide interactionsCode0
MANTRA: The Manifold Triangulations AssemblageCode0
BeSt-LeS: Benchmarking Stroke Lesion Segmentation using Deep SupervisionCode0
debiaSAE: Benchmarking and Mitigating Vision-Language Model BiasCode0
VizSeq: A Visual Analysis Toolkit for Text Generation TasksCode0
PATH: A Discrete-sequence Dataset for Evaluating Online Unsupervised Anomaly Detection Approaches for Multivariate Time SeriesCode0
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE CorpusCode0
Margin-bounded Confidence Scores for Out-of-Distribution DetectionCode0
Show:102550
← PrevPage 466 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified