| Ocean-OCR: Towards General OCR Application via a Vision-Language Model | Jan 26, 2025 | document understandingLanguage Modeling | CodeCode Available | 1 |
| ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer | Jan 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques | Jan 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing | Jan 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing Biomedical Relation Extraction with Directionality | Jan 23, 2025 | BenchmarkingDocument-level Relation Extraction | CodeCode Available | 1 |
| PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language Model | Jan 21, 2025 | HallucinationImage Captioning | CodeCode Available | 1 |
| Glinthawk: A Two-Tiered Architecture for Offline LLM Inference | Jan 20, 2025 | CPULanguage Modeling | CodeCode Available | 1 |
| EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery | Jan 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model | Jan 19, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport | Jan 16, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |