| LLMs in Coding and their Impact on the Commercial Software Engineering Landscape | Jun 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need | Jun 18, 2025 | GSM8KHumanEval | CodeCode Available | 0 |
| Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks | Jun 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses | Jun 18, 2025 | Large Language Model | —Unverified | 0 |
| LLM Agent for Hyper-Parameter Optimization | Jun 18, 2025 | Large Language Model | —Unverified | 0 |
| RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments | Jun 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM | Jun 17, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs | Jun 17, 2025 | Data IntegrationLarge Language Model | —Unverified | 0 |
| Utility-Driven Speculative Decoding for Mixture-of-Experts | Jun 17, 2025 | GPULarge Language Model | —Unverified | 0 |
| Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition | Jun 17, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning | Jun 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing | Jun 17, 2025 | AttributeDisentanglement | —Unverified | 0 |
| FEAST: A Flexible Mealtime-Assistance System Towards In-the-Wild Personalization | Jun 17, 2025 | Large Language Model | —Unverified | 0 |
| Unified Software Engineering agent as AI Software Engineer | Jun 17, 2025 | Large Language Model | —Unverified | 0 |
| CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model | Jun 16, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 |
| Calibrated Predictive Lower Bounds on Time-to-Unsafe-Sampling in LLMs | Jun 16, 2025 | Conformal PredictionLarge Language Model | —Unverified | 0 |
| Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems | Jun 16, 2025 | Large Language Model | —Unverified | 0 |
| VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation | Jun 16, 2025 | Data VisualizationLanguage Modeling | CodeCode Available | 0 |
| EmoNews: A Spoken Dialogue System for Expressive News Conversations | Jun 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ProfiLLM: An LLM-Based Framework for Implicit Profiling of Chatbot Users | Jun 16, 2025 | ChatbotLarge Language Model | —Unverified | 0 |
| ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection | Jun 16, 2025 | Data AugmentationLarge Language Model | —Unverified | 0 |
| SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation | Jun 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship in DeepSeek | Jun 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |