| Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners | May 26, 2025 | MuJoCovalid | —Unverified | 0 |
| Optimal Conformal Prediction under Epistemic Uncertainty | May 25, 2025 | Conformal PredictionPrediction | CodeCode Available | 0 |
| NTIRE 2025 Challenge on Video Quality Enhancement for Video Conferencing: Datasets, Methods and Results | May 25, 2025 | validVideo Quality Assessment | CodeCode Available | 0 |
| Efficient Long CoT Reasoning in Small Language Models | May 24, 2025 | Mathematical Reasoningvalid | —Unverified | 0 |
| MedScore: Factuality Evaluation of Free-Form Medical Answers | May 24, 2025 | FormHallucination | CodeCode Available | 0 |
| Flexible MOF Generation with Torsion-Aware Flow Matching | May 23, 2025 | valid | —Unverified | 0 |
| Efficient Adaptive Experimentation with Non-Compliance | May 23, 2025 | valid | CodeCode Available | 0 |
| Effects of auditory distance cues and reverberation on spatial perception and listening strategies | May 23, 2025 | valid | CodeCode Available | 0 |
| Applications of Modular Co-Design for De Novo 3D Molecule Generation | May 23, 2025 | 3D Molecule GenerationDenoising | —Unverified | 0 |
| Anytime-valid, Bayes-assisted,Prediction-Powered Inference | May 23, 2025 | Predictionvalid | —Unverified | 0 |
| Graph Style Transfer for Counterfactual Explainability | May 23, 2025 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| MuseRAG: Idea Originality Scoring At Scale | May 22, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| Statistical Inference for Online Algorithms | May 22, 2025 | valid | CodeCode Available | 0 |
| A collaborative constrained graph diffusion model for the generation of realistic synthetic molecules | May 22, 2025 | valid | CodeCode Available | 0 |
| Statistical Test for Saliency Maps of Graph Neural Networks via Selective Inference | May 22, 2025 | valid | —Unverified | 0 |
| Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets | May 21, 2025 | Diversityvalid | —Unverified | 0 |
| Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study | May 21, 2025 | valid | —Unverified | 0 |
| Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack | May 21, 2025 | Multiple-choiceMultiple Choice Question Answering (MCQA) | —Unverified | 0 |
| Projection-Based Correction for Enhancing Deep Inverse Networks | May 21, 2025 | valid | —Unverified | 0 |
| Valid Post-Contextual Bandit Inference | May 20, 2025 | Translationvalid | —Unverified | 0 |
| Learning to Insert for Constructive Neural Vehicle Routing Solver | May 20, 2025 | Model OptimizationPosition | —Unverified | 0 |
| Temporal Alignment of Time Sensitive Facts with Activation Engineering | May 20, 2025 | valid | —Unverified | 0 |
| A Comprehensive Benchmarking Platform for Deep Generative Models in Molecular Design | May 19, 2025 | BenchmarkingDrug Discovery | —Unverified | 0 |
| NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results | May 17, 2025 | valid | —Unverified | 0 |
| Coherent Language Reconstruction from Brain Recordings with Flexible Multi-Modal Input Stimuli | May 15, 2025 | valid | —Unverified | 0 |