| PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging | May 17, 2025 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation | May 17, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| CorBenchX: Large-Scale Chest X-Ray Error Dataset and Vision-Language Model Benchmark for Report Error Correction | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Noise Injection Systemically Degrades Large Language Model Safety Guardrails | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents | May 16, 2025 | FormLanguage Modeling | CodeCode Available | 0 |
| Token-Level Uncertainty Estimation for Large Language Model Reasoning | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| THELMA: Task Based Holistic Evaluation of Large Language Model Applications-RAG Question Answering | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model | May 16, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |