| MedScore: Factuality Evaluation of Free-Form Medical Answers | May 24, 2025 | FormHallucination | CodeCode Available | 0 |
| Crowdsourcing StoryLines: Harnessing the Crowd for Causal Relation Annotation | Aug 1, 2018 | ArticlesRelation | CodeCode Available | 0 |
| Sample Compression Unleashed: New Generalization Bounds for Real Valued Losses | Sep 26, 2024 | Generalization Boundsvalid | CodeCode Available | 0 |
| Active, anytime-valid risk controlling prediction sets | Jun 15, 2024 | Predictionvalid | CodeCode Available | 0 |
| Sampling with Mirrored Stein Operators | Jun 23, 2021 | valid | CodeCode Available | 0 |
| Mesh-Informed Reduced Order Models for Aneurysm Rupture Risk Prediction | Oct 4, 2024 | Decision Makingvalid | CodeCode Available | 0 |
| A High-dimensional Convergence Theorem for U-statistics with Applications to Kernel-based Testing | Feb 11, 2023 | valid | CodeCode Available | 0 |
| A Critical Analysis of Classifier Selection in Learned Bloom Filters | Nov 28, 2022 | valid | CodeCode Available | 0 |
| Physics-Aware Combinatorial Assembly Sequence Planning using Data-free Action Masking | Aug 19, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 0 |
| Physics-driven Fire Modeling from Multi-view Images | Apr 14, 2018 | Physical Simulationsvalid | CodeCode Available | 0 |
| Challenges in Markov chain Monte Carlo for Bayesian neural networks | Oct 15, 2019 | Bayesian Inferencevalid | CodeCode Available | 0 |
| AugSumm: towards generalizable speech summarization using synthetic labels from large language model | Jan 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Cross-validation Confidence Intervals for Test Error | Jul 24, 2020 | valid | CodeCode Available | 0 |
| Metric-Guided Conformal Bounds for Probabilistic Image Reconstruction | Apr 23, 2024 | Computed Tomography (CT)Conformal Prediction | CodeCode Available | 0 |