| Video based real-time positional tracker | Sep 17, 2020 | Position | —Unverified | 0 |
| VideoGen: Generative Modeling of Videos using VQ-VAE and Transformers | Jan 1, 2021 | PositionVideo Generation | —Unverified | 0 |
| Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition | Aug 22, 2018 | Action RecognitionActivity Recognition | —Unverified | 0 |
| Video Motion Capture from the Part Confidence Maps of Multi-Camera Images by Spatiotemporal Filtering Using the Human Skeletal Model | Dec 9, 2019 | 3D ReconstructionPosition | —Unverified | 0 |
| View-Invariant Localization using Semantic Objects in Changing Environments | Sep 28, 2022 | Position | —Unverified | 0 |
| Visible light communication-based monitoring for indoor environments using unsupervised learning | Jan 20, 2021 | ObjectPosition | —Unverified | 0 |
| Vision-Assisted Digital Twin Creation for mmWave Beam Management | Jan 31, 2024 | ManagementPosition | —Unverified | 0 |
| Vision-Based Robust Lane Detection and Tracking under Different Challenging Environmental Conditions | Oct 19, 2022 | Lane DetectionPosition | —Unverified | 0 |
| Vision-Based Proprioceptive Sensing for Soft Inflatable Actuators | Sep 19, 2019 | Position | —Unverified | 0 |
| Vision-Based Safety System for Barrierless Human-Robot Collaboration | Aug 3, 2022 | Position | —Unverified | 0 |
| Vision-based Target Pose Estimation with Multiple Markers for the Perching of UAVs | Apr 25, 2023 | Pose EstimationPosition | —Unverified | 0 |
| Vision-Based Terrain Relative Navigation on High-Altitude Balloon and Sub-Orbital Rocket | Feb 16, 2023 | Position | —Unverified | 0 |
| Vision-based Vehicle Re-identification in Bridge Scenario using Flock Similarity | Mar 12, 2024 | License Plate RecognitionPosition | —Unverified | 0 |
| VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens | Jan 1, 2024 | HallucinationPosition | —Unverified | 0 |
| Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens | Dec 12, 2023 | HallucinationPosition | —Unverified | 0 |
| Visual-based Positioning and Pose Estimation | Apr 20, 2022 | Pose EstimationPosition | —Unverified | 0 |
| Visual In-Context Learning for Large Vision-Language Models | Feb 18, 2024 | In-Context LearningPosition | —Unverified | 0 |
| Visual Layout Composer: Image-Vector Dual Diffusion Model for Design Layout Generation | Jan 1, 2024 | Layout DesignLayout Generation | —Unverified | 0 |
| Visual Modulation of Human Responses to Support Surface Translation | Mar 5, 2021 | PositionTranslation | —Unverified | 0 |
| Visual Odometry for RGB-D Cameras | Mar 28, 2022 | PositionVisual Odometry | —Unverified | 0 |
| Visual Place Recognition | Nov 26, 2022 | PositionVisual Place Recognition | —Unverified | 0 |
| Visual-tactile Fusion for Transparent Object Grasping in Complex Backgrounds | Nov 30, 2022 | ClassificationDataset Generation | —Unverified | 0 |
| Visual Tracking Using Pertinent Patch Selection and Masking | Jun 1, 2014 | PositionVisual Tracking | —Unverified | 0 |
| Visual Whole-Body Control for Legged Loco-Manipulation | Mar 25, 2024 | Position | —Unverified | 0 |
| ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers | May 26, 2025 | cross-modal alignmentPosition | —Unverified | 0 |
| ViT-LSLA: Vision Transformer with Light Self-Limited-Attention | Oct 31, 2022 | Position | —Unverified | 0 |
| VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image | Apr 20, 2025 | 3D Interacting Hand Pose EstimationComputational Efficiency | —Unverified | 0 |
| Volumetric Supervised Contrastive Learning for Seismic Semantic Segmentation | Jun 16, 2022 | Contrastive LearningPosition | —Unverified | 0 |
| VQ3D: Learning a 3D-Aware Generative Model on ImageNet | Feb 14, 2023 | DecoderNeRF | —Unverified | 0 |
| VR IQA NET: Deep Virtual Reality Image Quality Assessment using Adversarial Learning | Apr 11, 2018 | Image Quality AssessmentPosition | —Unverified | 0 |
| Waveguide Division Multiple Access for Pinching-Antenna Systems (PASS) | Feb 25, 2025 | Position | —Unverified | 0 |
| Wavelet-based Positional Representation for Long Context | Feb 4, 2025 | Position | —Unverified | 0 |
| Wayfinding and cognitive maps for pedestrian models | Feb 5, 2016 | Position | —Unverified | 0 |
| Weak contraction mapping and optimization | May 1, 2019 | Position | —Unverified | 0 |
| Weakly Aligned Feature Fusion for Multimodal Object Detection | Apr 21, 2022 | Objectobject-detection | —Unverified | 0 |
| Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning | Feb 28, 2022 | Positionreinforcement-learning | —Unverified | 0 |
| Weighted position value for Network games | Aug 7, 2023 | Position | —Unverified | 0 |
| Weighted Unsupervised Learning for 3D Object Detection | Feb 18, 2016 | 3D Object DetectionClustering | —Unverified | 0 |
| We need to talk about random seeds | Nov 16, 2021 | Position | —Unverified | 0 |
| What About Applied Fairness? | Jun 13, 2018 | FairnessPosition | —Unverified | 0 |
| What Are the Invariant Occlusive Components of Image Patches? A Probabilistic Generative Approach | Dec 1, 2013 | Position | —Unverified | 0 |
| What can Neural Referential Form Selectors Learn? | Aug 15, 2021 | FormPosition | —Unverified | 0 |
| What do you mean, BERT? Assessing BERT as a Distributional Semantics Model | Nov 13, 2019 | PositionSentence | —Unverified | 0 |
| What drives a goalkeepers' decisions? | Nov 1, 2022 | Position | —Unverified | 0 |
| What is Multimodality? | Mar 10, 2021 | BIG-bench Machine LearningPosition | —Unverified | 0 |
| What is the social benefit of hate speech detection research? A Systematic Review | Sep 26, 2024 | Hate Speech DetectionPosition | —Unverified | 0 |
| What is YOLOv5: A deep look into the internal features of the popular object detector | Jul 30, 2024 | Objectobject-detection | —Unverified | 0 |
| What Makes a Good Dataset for Symbol Description Reading? | Apr 17, 2023 | document understandingMath | —Unverified | 0 |
| What makes a good pause? Investigating the turn-holding effects of fillers | May 3, 2023 | Position | —Unverified | 0 |
| What Makes Popular Culture Popular? Product Features and Optimal Differentiation in Music | Sep 6, 2017 | Cultural Vocal Bursts Intensity PredictionPosition | —Unverified | 0 |