SOTAVerified

valid

Papers

Showing 626650 of 3589 papers

TitleStatusHype
Talking to the brain: Using Large Language Models as Proxies to Model Brain Semantic Representation0
Overcoming Dependent Censoring in the Evaluation of Survival ModelsCode0
Uncertainty Quantification for LLM-Based Survey Simulations0
Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model GeneralizationCode0
Data-Driven Input-Output Control Barrier Functions0
REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction0
Quantifying Logical Consistency in Transformers via Query-Key Alignment0
Your Assumed DAG is Wrong and Here's How To Deal With ItCode0
Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs0
Pricing Valid Cuts for Price-Match Equilibria0
EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization FormulationsCode0
Towards a Perspectivist Turn in Argument Quality AssessmentCode0
Explainable Distributed Constraint Optimization Problems0
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis0
Conformal Prediction under Levy-Prokhorov Distribution Shifts: Robustness to Local and Global PerturbationsCode0
Generalization error bound for denoising score matching under relaxed manifold assumption0
Likelihood-Ratio Regularized Quantile Regression: Adapting Conformal Prediction to High-Dimensional Covariate Shifts0
GiFT: Gibbs Fine-Tuning for Code GenerationCode0
Deep Incomplete Multi-view Learning via Cyclic Permutation of VAEs0
The Relationship between No-Regret Learning and Online Conformal Prediction0
A new and flexible class of sharp asymptotic time-uniform confidence sequences0
Self-Normalized Inference in (Quantile, Expected Shortfall) Regressions for Time Series0
CRANE: Reasoning with constrained LLM generation0
Multi-Objective Planning with Contextual Lexicographic Reward Preferences0
Generalizability through Explainability: Countering Overfitting with Counterfactual Examples0
Show:102550
← PrevPage 26 of 144Next →

No leaderboard results yet.