SOTAVerified

Benchmarking

Papers

Showing 16411650 of 5548 papers

TitleStatusHype
A Neuro-Symbolic Framework for Sequence Classification with Relational and Temporal KnowledgeCode0
Knowing-how & Knowing-that: A New Task for Machine Comprehension of User ManualsCode0
LANTERN: A Machine Learning Framework for Lipid Nanoparticle Transfection Efficiency PredictionCode0
Benchmarking AutoML algorithms on a collection of synthetic classification problemsCode0
A Neuromorphic Dataset for Object Segmentation in Indoor Cluttered EnvironmentCode0
Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMsCode0
A Neural-embedded Choice Model: TasteNet-MNL Modeling Taste Heterogeneity with Flexibility and InterpretabilityCode0
Ab Initio Nonparametric Variable Selection for Scalable Symbolic Regression with Large pCode0
DyKnow: Dynamically Verifying Time-Sensitive Factual Knowledge in LLMsCode0
Benchmarking a transformer-FREE model for ad-hoc retrievalCode0
Show:102550
← PrevPage 165 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified