| VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models | Apr 21, 2025 | AttributeVisual Reasoning | —Unverified | 0 |
| VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping | Dec 15, 2024 | 3D ReconstructionAttribute | —Unverified | 0 |
| V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data | Jun 20, 2024 | AttributeVideo Editing | —Unverified | 0 |
| VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models | Mar 8, 2025 | AttributeDeepFake Detection | —Unverified | 0 |
| VLM-Guard: Safeguarding Vision-Language Models via Fulfilling Safety Alignment Gap | Feb 14, 2025 | AttributeSafety Alignment | —Unverified | 0 |
| Voice Attribute Editing with Text Prompt | Apr 13, 2024 | Attribute | —Unverified | 0 |
| VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing | Apr 10, 2024 | Attribute | —Unverified | 0 |
| VoiSeR: A New Benchmark for Voice-Based Search Refinement | Apr 1, 2021 | AttributeConversational Search | —Unverified | 0 |
| Volumetric 3D Point Cloud Attribute Compression: Learned polynomial bilateral filter for prediction | Nov 22, 2023 | AttributeDecoder | —Unverified | 0 |
| Volumetric Attribute Compression for 3D Point Clouds using Feedforward Network with Geometric Attention | Apr 1, 2023 | 3D geometryAttribute | —Unverified | 0 |