| Shushing! Let's Imagine an Authentic Speech from the Silent Video | Mar 19, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| RWKV-7 "Goose" with Expressive Dynamic State Evolution | Mar 18, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 9 |
| MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments | Mar 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels | Mar 18, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| Layer-wise Adaptive Gradient Norm Penalizing Method for Efficient and Accurate Deep Learning | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatBEV: A Visual Language Model that Understands BEV Maps | Mar 18, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences | Mar 18, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| KVShare: An LLM Service System with Efficient and Effective Multi-Tenant KV Cache Reuse | Mar 17, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model | Mar 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing | Mar 17, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model | Mar 17, 2025 | Continual LearningLanguage Modeling | —Unverified | 0 |
| Agents Play Thousands of 3D Video Games | Mar 17, 2025 | FPS GamesLanguage Modeling | —Unverified | 0 |
| HybridGen: VLM-Guided Hybrid Planning for Scalable Data Generation of Imitation Learning | Mar 17, 2025 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| High-entropy Advantage in Neural Networks' Generalizability | Mar 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Valid Text-to-SQL Generation with Unification-based DeepStochLog | Mar 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling | Mar 17, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing | Mar 16, 2025 | Change DetectionImage Captioning | —Unverified | 0 |
| Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma? | Mar 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| State Fourier Diffusion Language Model (SFDLM): A Scalable, Novel Iterative Approach to Language Modeling | Mar 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression | Mar 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Survey on the Optimization of Large Language Model-based Agents | Mar 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| LLM-Mediated Guidance of MARL Systems | Mar 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Applications of Large Language Model Reasoning in Feature Generation | Mar 15, 2025 | Computational EfficiencyDomain Adaptation | —Unverified | 0 |
| Maritime Mission Planning for Unmanned Surface Vessel using Large Language Model | Mar 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unified Modeling Language Code Generation from Diagram Images Using Multimodal Large Language Models | Mar 15, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| Research on Large Language Model Cross-Cloud Privacy Protection and Collaborative Training based on Federated Learning | Mar 15, 2025 | Cloud ComputingFederated Learning | —Unverified | 0 |
| Tailor: An Integrated Text-Driven CG-Ready Human and Garment Generation System | Mar 15, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Interpretation Gaps in LLM-Assisted Comprehension of Privacy Documents | Mar 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Test-Time Training Provably Improves Transformers as In-context Learners | Mar 14, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Large language model-powered AI systems achieve self-replication with no human intervention | Mar 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal Control | Mar 14, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Don't Forget It! Conditional Sparse Autoencoder Clamping Works for Unlearning | Mar 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BriLLM: Brain-inspired Large Language Model | Mar 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 15 |
| TigerLLM -- A Family of Bangla Large Language Models | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity | Mar 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Text Compression for Efficient Language Generation | Mar 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reasoning-Grounded Natural Language Explanations for Language Models | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models | Mar 14, 2025 | Checkmate In OneGSM8K | —Unverified | 0 |
| Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models | Mar 14, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| LLM Agents for Education: Advances and Applications | Mar 14, 2025 | FairnessHallucination | —Unverified | 0 |
| Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generative Modeling for Mathematical Discovery | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Potential of large language model-powered nudges for promoting daily water and energy conservation | Mar 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 |