| Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs | Mar 31, 2025 | Large Language ModelVideo Chaptering | CodeCode Available | 2 |
| Agents Under Siege: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks | Mar 31, 2025 | Adversarial AttackLarge Language Model | —Unverified | 0 |
| DrunkAgent: Stealthy Memory Corruption in LLM-Powered Recommender Agents | Mar 31, 2025 | Collaborative FilteringLarge Language Model | —Unverified | 0 |
| Aud-Sur: An Audio Analyzer Assistant for Audio Surveillance Applications | Mar 31, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection | Mar 31, 2025 | Fraud DetectionLarge Language Model | CodeCode Available | 2 |
| Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training | Mar 31, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Rethinking Key-Value Cache Compression Techniques for Large Language Model Serving | Mar 31, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning Guidance | Mar 31, 2025 | Large Language Model | —Unverified | 0 |
| PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference | Mar 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages | Mar 30, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 1 |