| Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices | Mar 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models | Mar 8, 2025 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model | Mar 8, 2025 | Image Quality AssessmentLanguage Modeling | CodeCode Available | 2 |
| Phraselette: A Poet's Procedural Palette | Mar 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance | Mar 7, 2025 | ARCLanguage Modeling | —Unverified | 0 |
| Is Your Video Language Model a Reliable Judge? | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Frequency Autoregressive Image Generation with Continuous Tokens | Mar 7, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement Learning | Mar 7, 2025 | Emotion RecognitionLanguage Modeling | CodeCode Available | 5 |
| A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Mar 7, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 2 |