| MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Apr 9, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 |
| PAYADOR: A Minimalist Approach to Grounding Language Models on Structured Data for Interactive Storytelling and Role-playing Games | Apr 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Multi-Phase Analysis of Blood Culture Stewardship: Machine Learning Prediction, Expert Recommendation Assessment, and LLM Automation | Apr 9, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning | Apr 9, 2025 | Action Unit DetectionAge Estimation | —Unverified | 0 |
| Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model | Apr 9, 2025 | Image Quality AssessmentImage Restoration | —Unverified | 0 |
| SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness | Apr 8, 2025 | ChatbotExtractive Summarization | CodeCode Available | 0 |
| ARLO: A Tailorable Approach for Transforming Natural Language Software Requirements into Architecture using LLMs | Apr 8, 2025 | Large Language Model | —Unverified | 0 |
| Are Generative AI Agents Effective Personalized Financial Advisors? | Apr 8, 2025 | Large Language Model | CodeCode Available | 0 |
| InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Control | Apr 8, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation | Apr 7, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |