| Statistical Inference for Online Algorithms | May 22, 2025 | valid | CodeCode Available | 0 |
| MuseRAG: Idea Originality Scoring At Scale | May 22, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| A collaborative constrained graph diffusion model for the generation of realistic synthetic molecules | May 22, 2025 | valid | CodeCode Available | 0 |
| Statistical Test for Saliency Maps of Graph Neural Networks via Selective Inference | May 22, 2025 | valid | —Unverified | 0 |
| Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack | May 21, 2025 | Multiple-choiceMultiple Choice Question Answering (MCQA) | —Unverified | 0 |
| Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study | May 21, 2025 | valid | —Unverified | 0 |
| Projection-Based Correction for Enhancing Deep Inverse Networks | May 21, 2025 | valid | —Unverified | 0 |
| ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges | May 21, 2025 | Mathvalid | CodeCode Available | 1 |
| Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets | May 21, 2025 | Diversityvalid | —Unverified | 0 |
| Temporal Alignment of Time Sensitive Facts with Activation Engineering | May 20, 2025 | valid | —Unverified | 0 |