| Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation | Feb 27, 2025 | Data AugmentationLogical Reasoning | —Unverified | 0 |
| Talking to the brain: Using Large Language Models as Proxies to Model Brain Semantic Representation | Feb 26, 2025 | Question Answeringvalid | —Unverified | 0 |
| Overcoming Dependent Censoring in the Evaluation of Survival Models | Feb 26, 2025 | Survival Analysisvalid | CodeCode Available | 0 |
| Universality of conformal prediction under the assumption of randomness | Feb 26, 2025 | Conformal PredictionPrediction | —Unverified | 0 |
| Shh, don't say that! Domain Certification in LLMs | Feb 26, 2025 | valid | —Unverified | 0 |
| Uncertainty Quantification for LLM-Based Survey Simulations | Feb 25, 2025 | SurveyUncertainty Quantification | —Unverified | 0 |
| Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Data-Driven Input-Output Control Barrier Functions | Feb 24, 2025 | State Estimationvalid | —Unverified | 0 |
| Quantifying Logical Consistency in Transformers via Query-Key Alignment | Feb 24, 2025 | Logical Reasoningvalid | —Unverified | 0 |
| REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction | Feb 24, 2025 | Event Argument Extractionvalid | —Unverified | 0 |