| Auditing Prompt Caching in Language Model APIs | Feb 11, 2025 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Implicit Language Models are RNNs: Balancing Parallelization and Expressivity | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AppVLM: A Lightweight Vision Language Model for Online App Control | Feb 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| K-ON: Stacking Knowledge On the Head Layer of Large Language Model | Feb 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates | Feb 10, 2025 | Hierarchical Reinforcement LearningLanguage Modeling | CodeCode Available | 4 |
| Recent Advances in Discrete Speech Tokens: A Review | Feb 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation | Feb 10, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE | Feb 10, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Rationalization Models for Text-to-SQL | Feb 10, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| μnit Scaling: Simple and Scalable FP8 LLM Training | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Feb 9, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 0 |
| Investigating Compositional Reasoning in Time Series Foundation Models | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform | Feb 9, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Effective Black-Box Multi-Faceted Attacks Breach Vision Large Language Model Guardrails | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enabling Autoregressive Models to Fill In Masked Tokens | Feb 9, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education | Feb 9, 2025 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ScaffoldGPT: A Scaffold-based GPT Model for Drug Optimization | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control | Feb 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and Understanding | Feb 8, 2025 | DenoisingImage Generation | CodeCode Available | 1 |
| IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System | Feb 8, 2025 | DecoderLanguage Modeling | CodeCode Available | 11 |
| Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging | Feb 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |