| TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining | Apr 2, 2025 | Continual LearningContinual Pretraining | CodeCode Available | 1 |
| Rethinking Key-Value Cache Compression Techniques for Large Language Model Serving | Mar 31, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy Reward | Mar 31, 2025 | Crowd CountingLanguage Modeling | CodeCode Available | 1 |
| Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages | Mar 30, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 1 |
| Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval | Mar 29, 2025 | AllLanguage Modeling | CodeCode Available | 1 |
| BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding | Mar 27, 2025 | FormLanguage Modeling | CodeCode Available | 1 |
| OpenHuEval: Evaluating Large Language Model on Hungarian Specifics | Mar 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |