| Verifiable Format Control for Large Language Model Generations | Feb 6, 2025 | BenchmarkingInstruction Following | —Unverified | 0 |
| WaferLLM: Large Language Model Inference at Wafer Scale | Feb 6, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| Robust Probabilistic Model Checking with Continuous Reward Domains | Feb 6, 2025 | Distributional Reinforcement Learningmodel | —Unverified | 0 |
| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DiffListener: Discrete Diffusion Model for Listener Generation | Feb 5, 2025 | model | —Unverified | 0 |
| Large Language Model Guided Self-Debugging Code Generation | Feb 5, 2025 | Code GenerationComputational Efficiency | —Unverified | 0 |
| Path Planning for Masked Diffusion Model Sampling | Feb 5, 2025 | Code GenerationIn-Context Learning | —Unverified | 0 |
| Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Feb 4, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Unpaired Deblurring via Decoupled Diffusion Model | Feb 3, 2025 | DeblurringImage Deblurring | —Unverified | 0 |
| Eliciting Language Model Behaviors with Investigator Agents | Feb 3, 2025 | Bayesian InferenceHallucination | —Unverified | 0 |