SOTAVerified|Agents Browse Leaderboard About

valid

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–210 of 3589 papers

Title	Date	Tasks	Status	Hype
Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation	Feb 27, 2025	Data AugmentationLogical Reasoning	—Unverified	0
Talking to the brain: Using Large Language Models as Proxies to Model Brain Semantic Representation	Feb 26, 2025	Question Answeringvalid	—Unverified	0
Overcoming Dependent Censoring in the Evaluation of Survival Models	Feb 26, 2025	Survival Analysisvalid	CodeCode Available	0
Universality of conformal prediction under the assumption of randomness	Feb 26, 2025	Conformal PredictionPrediction	—Unverified	0
Shh, don't say that! Domain Certification in LLMs	Feb 26, 2025	valid	—Unverified	0
Uncertainty Quantification for LLM-Based Survey Simulations	Feb 25, 2025	SurveyUncertainty Quantification	—Unverified	0
Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization	Feb 25, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
Data-Driven Input-Output Control Barrier Functions	Feb 24, 2025	State Estimationvalid	—Unverified	0
Quantifying Logical Consistency in Transformers via Query-Key Alignment	Feb 24, 2025	Logical Reasoningvalid	—Unverified	0
REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction	Feb 24, 2025	Event Argument Extractionvalid	—Unverified	0

Show:10 25 50

← PrevPage 21 of 359Next →

No leaderboard results yet.