| Visual grounding for desktop graphical user interfaces | May 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visual Grounding Strategies for Text-Only Natural Language Processing | Mar 25, 2021 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Visualizing and Explaining Language Models | Apr 30, 2022 | Deep LearningLanguage Modeling | —Unverified | 0 |
| Visualizing and Understanding the Effectiveness of BERT | Aug 15, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visualizing attention zones in machine reading comprehension models | Oct 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visualizing Linguistic Shift | Nov 20, 2016 | Document ClassificationLanguage Modeling | —Unverified | 0 |
| Visualizing the Content of a Children's Story in a Virtual World: Lessons Learned | Nov 1, 2016 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visualizing the Relationship Between Encoded Linguistic Information and Task Performance | Mar 29, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visual Language Modeling on CNN Image Representations | Nov 9, 2015 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visual-Language Model Knowledge Distillation Method for Image Quality Assessment | Jul 21, 2025 | Image Quality AssessmentKnowledge Distillation | —Unverified | 0 |
| Semantically-Prompted Language Models Improve Visual Descriptions | Jun 5, 2023 | ClassificationDescriptive | —Unverified | 0 |
| Like a Baby: Visually Situated Neural Language Acquisition | May 29, 2018 | Language AcquisitionLanguage Modeling | —Unverified | 0 |
| VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning | Oct 30, 2024 | Hierarchical Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models | Dec 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visual Question Answering Instruction: Unlocking Multimodal Large Language Model To Domain-Specific Visual Multitasks | Feb 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Visual Speech Language Models | Sep 14, 2018 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation | Oct 11, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| ViTOC: Vision Transformer and Object-aware Captioner | Nov 9, 2024 | DiversityImage Captioning | —Unverified | 0 |
| VITRO: Vocabulary Inversion for Time-series Representation Optimization | Dec 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VL-BEiT: Generative Vision-Language Pretraining | Jun 2, 2022 | image-classificationImage Classification | —Unverified | 0 |
| VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration | Oct 29, 2024 | GPULanguage Modeling | —Unverified | 0 |
| VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection | May 19, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture | Apr 17, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |
| VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks | Oct 7, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision | Dec 19, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| VLMAE: Vision-Language Masked Autoencoder | Aug 19, 2022 | Image-text RetrievalLanguage Modeling | —Unverified | 0 |
| VL-Mamba: Exploring State Space Models for Multimodal Learning | Mar 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VLMaterial: Procedural Material Generation with Large Vision-Language Models | Jan 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis | Nov 27, 2024 | Human-Object Interaction DetectionImage-text matching | —Unverified | 0 |
| VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition | Aug 29, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| VLM-PL: Advanced Pseudo Labeling Approach for Class Incremental Object Detection via Vision-Language Model | Mar 8, 2024 | Class-Incremental Object DetectionIncremental Learning | —Unverified | 0 |
| VLM-RRT: Vision Language Model Guided RRT Search for Autonomous UAV Navigation | May 29, 2025 | Disaster ResponseLanguage Modeling | —Unverified | 0 |
| VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model | Oct 11, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding | May 20, 2021 | Action SegmentationLanguage Modeling | —Unverified | 0 |
| VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation | Feb 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EVJVQA Challenge: Multilingual Visual Question Answering | Feb 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection | Sep 30, 2024 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| Vocabulary Attack to Hijack Large Language Model Applications | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VocalAgent: Large Language Models for Vocal Health Diagnostics with Safety-Aware Evaluation | May 19, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks | Jul 29, 2024 | Deep LearningDomain Generalization | —Unverified | 0 |
| VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model | Jan 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation | Mar 27, 2025 | Autonomous NavigationLanguage Modeling | —Unverified | 0 |
| Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks | Sep 14, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| VQ-Logits: Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VQ-T: RNN Transducers using Vector-Quantized Prediction Network States | Aug 3, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VR-GPT: Visual Language Model for Intelligent Virtual Reality Applications | May 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VSA4VQA: Scaling a Vector Symbolic Architecture to Visual Question Answering on Natural Images | May 6, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VSLLaVA: a pipeline of large multimodal foundation model for industrial vibration signal analysis | Sep 3, 2024 | Fault DiagnosisLanguage Modeling | —Unverified | 0 |
| VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |