| Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing | Feb 4, 2025 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues | Feb 4, 2025 | Dialogue InterpretationDialogue Understanding | —Unverified | 0 |
| Knowledge Synthesis of Photosynthesis Research Using a Large Language Model | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Eliciting Language Model Behaviors with Investigator Agents | Feb 3, 2025 | Bayesian InferenceHallucination | —Unverified | 0 |
| InfoBridge: Mutual Information estimation via Bridge Matching | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaling Embedding Layers in Language Models | Feb 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Learn Weight Generation via Local Consistency Diffusion | Feb 3, 2025 | Domain GeneralizationFew-Shot Learning | —Unverified | 0 |