| ViP-CNN: Visual Phrase Guided Convolutional Neural Network | Feb 23, 2017 | DescriptiveImage Captioning | —Unverified | 0 |
| Visual Relationship Detection Using Part-and-Sum Transformers with Composite Queries | May 5, 2021 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Visually Similar Products Retrieval for Shopsy | Oct 10, 2022 | AttributeImage Compression | —Unverified | 0 |
| Visual Relationship Detection with Low Rank Non-Negative Tensor Decomposition | Nov 22, 2019 | FormRelationship Detection | —Unverified | 0 |
| Visual-Semantic Matching by Exploring High-Order Attention and Distraction | Jun 1, 2020 | AttributeGraph Attention | —Unverified | 0 |
| VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping | Dec 15, 2024 | 3D ReconstructionAttribute | —Unverified | 0 |
| VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis | Nov 27, 2024 | Human-Object Interaction DetectionImage-text matching | —Unverified | 0 |
| Voice-Face Cross-modal Matching and Retrieval: A Benchmark | Nov 21, 2019 | RetrievalTriplet | —Unverified | 0 |
| Watch Where You Head: A View-biased Domain Gap in Gait Recognition and Unsupervised Adaptation | Jul 13, 2023 | Domain AdaptationGait Recognition | —Unverified | 0 |
| Waveform Driven Plasticity in BiFeO3 Memristive Devices: Model and Implementation | Dec 1, 2012 | Triplet | —Unverified | 0 |