SOTAVerified

valid

Papers

Showing 201225 of 3589 papers

TitleStatusHype
Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation0
Overcoming Dependent Censoring in the Evaluation of Survival ModelsCode0
Universality of conformal prediction under the assumption of randomness0
Talking to the brain: Using Large Language Models as Proxies to Model Brain Semantic Representation0
Shh, don't say that! Domain Certification in LLMs0
Uncertainty Quantification for LLM-Based Survey Simulations0
Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model GeneralizationCode0
Data-Driven Input-Output Control Barrier Functions0
Quantifying Logical Consistency in Transformers via Query-Key Alignment0
REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction0
Your Assumed DAG is Wrong and Here's How To Deal With ItCode0
Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs0
Pricing Valid Cuts for Price-Match Equilibria0
EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization FormulationsCode0
Towards a Perspectivist Turn in Argument Quality AssessmentCode0
Explainable Distributed Constraint Optimization Problems0
Conformal Prediction under Levy-Prokhorov Distribution Shifts: Robustness to Local and Global PerturbationsCode0
Generalization error bound for denoising score matching under relaxed manifold assumption0
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis0
Likelihood-Ratio Regularized Quantile Regression: Adapting Conformal Prediction to High-Dimensional Covariate Shifts0
GiFT: Gibbs Fine-Tuning for Code GenerationCode0
Deep Incomplete Multi-view Learning via Cyclic Permutation of VAEs0
The Relationship between No-Regret Learning and Online Conformal Prediction0
A new and flexible class of sharp asymptotic time-uniform confidence sequences0
Self-Normalized Inference in (Quantile, Expected Shortfall) Regressions for Time Series0
Show:102550
← PrevPage 9 of 144Next →

No leaderboard results yet.