| Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission Generation | Jan 9, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| TreeKV: Smooth Key-Value Cache Compression with Tree Structures | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model | Jan 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model | Jan 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach | Jan 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation | Jan 8, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis | Jan 8, 2025 | DecoderEmotional Speech Synthesis | CodeCode Available | 2 |