SOTAVerified|Agents Browse Leaderboard About

valid

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 841–850 of 3589 papers

Title	Date	Tasks	Status	Hype	Score
CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents	Jun 17, 2024	Code GenerationCode Search	CodeCode Available	0	5
Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy	May 24, 2023	In-Context LearningMultiple-choice	CodeCode Available	0	5
EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations	Feb 20, 2025	Combinatorial Optimizationvalid	CodeCode Available	0	5
A PAC-Bayes Analysis of Adversarial Robustness	Feb 19, 2021	Adversarial RobustnessGeneralization Bounds	CodeCode Available	0	5
Model Generalization: A Sharpness Aware Optimization Perspective	Aug 14, 2022	modelvalid	CodeCode Available	0	5
Enhancing reliability in prediction intervals using point forecasters: Heteroscedastic Quantile Regression and Width-Adaptive Conformal Inference	Jun 21, 2024	PredictionPrediction Intervals	CodeCode Available	0	5
Endogenous Macrodynamics in Algorithmic Recourse	Aug 16, 2023	counterfactualvalid	CodeCode Available	0	5
Instrumental Variable Estimation for Compositional Treatments	Jun 21, 2021	Diversityvalid	CodeCode Available	0	5
Employing self-supervised learning models for cross-linguistic child speech maturity classification	Jun 10, 2025	Self-Supervised Learningvalid	CodeCode Available	0	5
EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction	Jul 1, 2022	Document-level Relation ExtractionJoint Entity and Relation Extraction	CodeCode Available	0	5

Show:10 25 50

← PrevPage 85 of 359Next →

No leaderboard results yet.