| VideoPoet: A Large Language Model for Zero-Shot Video Generation | Dec 21, 2023 | DecoderLanguage Modeling | —Unverified | 0 | 0 |
| Video Summarization with Large Language Models | Apr 15, 2025 | Large Language ModelVideo Summarization | —Unverified | 0 | 0 |
| Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| ViLLM-Eval: A Comprehensive Evaluation Suite for Vietnamese Large Language Models | Apr 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Vi-Mistral-X: Building a Vietnamese Language Model with Advanced Continual Pre-training | Mar 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| VinaLLaMA: LLaMA-based Vietnamese Foundation Model | Dec 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| ViPer: Visual Personalization of Generative Models via Individual Preference Learning | Jul 24, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 | 0 |
| VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models | Feb 14, 2025 | Image CaptioningLarge Language Model | —Unverified | 0 | 0 |
| Vision and Intention Boost Large Language Model in Long-Term Action Anticipation | May 3, 2025 | Action AnticipationIn-Context Learning | —Unverified | 0 | 0 |