| Visual Whole-Body Control for Legged Loco-Manipulation | Mar 25, 2024 | Position | —Unverified | 0 | 0 |
| ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers | May 26, 2025 | cross-modal alignmentPosition | —Unverified | 0 | 0 |
| ViT-LSLA: Vision Transformer with Light Self-Limited-Attention | Oct 31, 2022 | Position | —Unverified | 0 | 0 |
| VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image | Apr 20, 2025 | 3D Interacting Hand Pose EstimationComputational Efficiency | —Unverified | 0 | 0 |
| Volumetric Supervised Contrastive Learning for Seismic Semantic Segmentation | Jun 16, 2022 | Contrastive LearningPosition | —Unverified | 0 | 0 |
| VQ3D: Learning a 3D-Aware Generative Model on ImageNet | Feb 14, 2023 | DecoderNeRF | —Unverified | 0 | 0 |
| VR IQA NET: Deep Virtual Reality Image Quality Assessment using Adversarial Learning | Apr 11, 2018 | Image Quality AssessmentPosition | —Unverified | 0 | 0 |
| Waveguide Division Multiple Access for Pinching-Antenna Systems (PASS) | Feb 25, 2025 | Position | —Unverified | 0 | 0 |
| Wavelet-based Positional Representation for Long Context | Feb 4, 2025 | Position | —Unverified | 0 | 0 |
| Wayfinding and cognitive maps for pedestrian models | Feb 5, 2016 | Position | —Unverified | 0 | 0 |