SOTAVerified|Agents Browse Leaderboard About

valid

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 271–280 of 3589 papers

Title	Date	Tasks	Status	Hype
BLADE: Benchmarking Language Model Agents for Data-Driven Science	Aug 19, 2024	BenchmarkingDecision Making	CodeCode Available	1
CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results	Feb 25, 2021	Face Anti-Spoofingvalid	CodeCode Available	1
Certified Deductive Reasoning with Language Models	Jun 6, 2023	Logical Reasoningvalid	CodeCode Available	1
Characterizing information loss in a chaotic double pendulum with the Information Bottleneck	Oct 25, 2022	valid	CodeCode Available	1
Chronocept: Instilling a Sense of Time in Machines	May 12, 2025	Fact CheckingRAG	CodeCode Available	1
Classification under Nuisance Parameters and Generalized Label Shift in Likelihood-Free Inference	Feb 8, 2024	Domain AdaptationUncertainty Quantification	CodeCode Available	1
CNN-based Approaches For Cross-Subject Classification in Motor Imagery: From The State-of-The-Art to DynamicNet	May 17, 2021	Brain Computer InterfaceDeep Learning	CodeCode Available	1
CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality	May 9, 2022	Machine TranslationSentence	CodeCode Available	1
Conditional Measurement Density Estimation in Sequential Monte Carlo via Normalizing Flow	Mar 16, 2022	Density Estimationvalid	CodeCode Available	1
Benchmarking structure-based three-dimensional molecular generative models using GenBench3D: ligand conformation quality matters	Jul 5, 2024	Benchmarkingvalid	CodeCode Available	1

Show:10 25 50

← PrevPage 28 of 359Next →

No leaderboard results yet.