| Your Assumed DAG is Wrong and Here's How To Deal With It | Feb 24, 2025 | Causal Discoveryvalid | CodeCode Available | 0 |
| Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs | Feb 21, 2025 | scientific discoveryvalid | —Unverified | 0 |
| Pricing Valid Cuts for Price-Match Equilibria | Feb 21, 2025 | valid | —Unverified | 0 |
| Towards a Perspectivist Turn in Argument Quality Assessment | Feb 20, 2025 | valid | CodeCode Available | 0 |
| EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations | Feb 20, 2025 | Combinatorial Optimizationvalid | CodeCode Available | 0 |
| Explainable Distributed Constraint Optimization Problems | Feb 19, 2025 | valid | —Unverified | 0 |
| Conformal Prediction under Levy-Prokhorov Distribution Shifts: Robustness to Local and Global Perturbations | Feb 19, 2025 | Conformal PredictionPrediction | CodeCode Available | 0 |
| Generalization error bound for denoising score matching under relaxed manifold assumption | Feb 19, 2025 | Denoisingvalid | —Unverified | 0 |
| What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis | Feb 19, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Likelihood-Ratio Regularized Quantile Regression: Adapting Conformal Prediction to High-Dimensional Covariate Shifts | Feb 18, 2025 | Conformal Predictionimage-classification | —Unverified | 0 |