| LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia Learning | May 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What do Language Model Probabilities Represent? From Distribution Estimation to Response Prediction | May 4, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation | May 4, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Vision and Intention Boost Large Language Model in Long-Term Action Anticipation | May 3, 2025 | Action AnticipationIn-Context Learning | —Unverified | 0 |
| Accelerating Large Language Model Reasoning via Speculative Search | May 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Intra-Layer Recurrence in Transformers for Language Modeling | May 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments | May 2, 2025 | Dataset GenerationLanguage Modeling | —Unverified | 0 |
| CodeSSM: Towards State Space Models for Code Understanding | May 2, 2025 | Clone DetectionLanguage Modeling | —Unverified | 0 |
| FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |