| video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models | Jun 18, 2025 | Audio captioningLarge Language Model | CodeCode Available | 2 |
| deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses | Jun 18, 2025 | Large Language Model | —Unverified | 0 |
| SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning | Jun 18, 2025 | Caption GenerationDescriptive | CodeCode Available | 2 |
| LLM Agent for Hyper-Parameter Optimization | Jun 18, 2025 | Large Language Model | —Unverified | 0 |
| DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing | Jun 17, 2025 | AttributeDisentanglement | —Unverified | 0 |
| Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition | Jun 17, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| FEAST: A Flexible Mealtime-Assistance System Towards In-the-Wild Personalization | Jun 17, 2025 | Large Language Model | —Unverified | 0 |
| Utility-Driven Speculative Decoding for Mixture-of-Experts | Jun 17, 2025 | GPULarge Language Model | —Unverified | 0 |
| ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM | Jun 17, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning | Jun 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unified Software Engineering agent as AI Software Engineer | Jun 17, 2025 | Large Language Model | —Unverified | 0 |
| Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs | Jun 17, 2025 | Data IntegrationLarge Language Model | —Unverified | 0 |
| From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RMIT-ADM+S at the SIGIR 2025 LiveRAG Challenge | Jun 17, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model | Jun 16, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 |
| Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems | Jun 16, 2025 | Large Language Model | —Unverified | 0 |
| Calibrated Predictive Lower Bounds on Time-to-Unsafe-Sampling in LLMs | Jun 16, 2025 | Conformal PredictionLarge Language Model | —Unverified | 0 |
| ProfiLLM: An LLM-Based Framework for Implicit Profiling of Chatbot Users | Jun 16, 2025 | ChatbotLarge Language Model | —Unverified | 0 |
| EmoNews: A Spoken Dialogue System for Expressive News Conversations | Jun 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection | Jun 16, 2025 | Data AugmentationLarge Language Model | —Unverified | 0 |
| Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model | Jun 16, 2025 | Large Language Modelmultimodal interaction | CodeCode Available | 5 |
| VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation | Jun 16, 2025 | Data VisualizationLanguage Modeling | CodeCode Available | 0 |
| SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation | Jun 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries | Jun 14, 2025 | Bug fixingInference Optimization | —Unverified | 0 |