| VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech | Jan 25, 2024 | DecoderHallucination | —Unverified | 0 | 0 |
| Variable Compliance Control for Robotic Peg-in-Hole Assembly: A Deep Reinforcement Learning Approach | Aug 24, 2020 | Deep Reinforcement LearningPosition | —Unverified | 0 | 0 |
| Variable-Speed Teaching-Playback as Real-World Data Augmentation for Imitation Learning | Dec 4, 2024 | Data AugmentationImitation Learning | —Unverified | 0 | 0 |
| Variable Stiffness for Robust Locomotion through Reinforcement Learning | Feb 13, 2025 | Positionreinforcement-learning | —Unverified | 0 | 0 |
| Goal-Conditioned Variational Autoencoder Trajectory Primitives with Continuous and Discrete Latent Codes | Dec 9, 2019 | Data AugmentationDecoder | —Unverified | 0 | 0 |
| Variational Information Bottleneck Model for Accurate Indoor Position Recognition | Jan 26, 2021 | DecoderPosition | —Unverified | 0 | 0 |
| Variational Tracking and Prediction with Generative Disentangled State-Space Models | Oct 14, 2019 | Bayesian InferencePosition | —Unverified | 0 | 0 |
| VECTORIZATION METHODS IN RECOMMENDER SYSTEM | Sep 27, 2018 | Collaborative FilteringPosition | —Unverified | 0 | 0 |
| (Vector) Space is Not the Final Frontier: Product Search as Program Synthesis | Apr 22, 2023 | Information RetrievalPosition | —Unverified | 0 | 0 |
| Vehicle Local Position Estimation System | Mar 23, 2015 | object-detectionObject Detection | —Unverified | 0 | 0 |
| Vehicle Speed Detecting App | Feb 17, 2017 | Position | —Unverified | 0 | 0 |
| Velocity integration in a multilayer neural field model of spatial working memory | Jan 16, 2017 | Position | —Unverified | 0 | 0 |
| Versatile optimization-based speed-up method for autofocusing in digital holographic microscopy | May 17, 2023 | Position | —Unverified | 0 | 0 |
| Vestibular Drop Attacks and Meniere's Disease as Results of Otolithic Membrane Damage -- A Numerical Model | Jul 3, 2021 | Position | —Unverified | 0 | 0 |
| VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Apr 4, 2024 | NeRFPosition | —Unverified | 0 | 0 |
| VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption | May 17, 2025 | DecoderPosition | —Unverified | 0 | 0 |
| Vibration Compensation of Delta 3D Printer with Position-varying Dynamics using Filtered B-Splines | Sep 14, 2022 | Position | —Unverified | 0 | 0 |
| Video based real-time positional tracker | Sep 17, 2020 | Position | —Unverified | 0 | 0 |
| VideoGen: Generative Modeling of Videos using VQ-VAE and Transformers | Jan 1, 2021 | PositionVideo Generation | —Unverified | 0 | 0 |
| Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition | Aug 22, 2018 | Action RecognitionActivity Recognition | —Unverified | 0 | 0 |
| Video Motion Capture from the Part Confidence Maps of Multi-Camera Images by Spatiotemporal Filtering Using the Human Skeletal Model | Dec 9, 2019 | 3D ReconstructionPosition | —Unverified | 0 | 0 |
| View-Invariant Localization using Semantic Objects in Changing Environments | Sep 28, 2022 | Position | —Unverified | 0 | 0 |
| Visible light communication-based monitoring for indoor environments using unsupervised learning | Jan 20, 2021 | ObjectPosition | —Unverified | 0 | 0 |
| Vision-Assisted Digital Twin Creation for mmWave Beam Management | Jan 31, 2024 | ManagementPosition | —Unverified | 0 | 0 |
| Vision-Based Robust Lane Detection and Tracking under Different Challenging Environmental Conditions | Oct 19, 2022 | Lane DetectionPosition | —Unverified | 0 | 0 |
| Vision-Based Proprioceptive Sensing for Soft Inflatable Actuators | Sep 19, 2019 | Position | —Unverified | 0 | 0 |
| Vision-Based Safety System for Barrierless Human-Robot Collaboration | Aug 3, 2022 | Position | —Unverified | 0 | 0 |
| Vision-based Target Pose Estimation with Multiple Markers for the Perching of UAVs | Apr 25, 2023 | Pose EstimationPosition | —Unverified | 0 | 0 |
| Vision-Based Terrain Relative Navigation on High-Altitude Balloon and Sub-Orbital Rocket | Feb 16, 2023 | Position | —Unverified | 0 | 0 |
| Vision-based Vehicle Re-identification in Bridge Scenario using Flock Similarity | Mar 12, 2024 | License Plate RecognitionPosition | —Unverified | 0 | 0 |
| VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens | Jan 1, 2024 | HallucinationPosition | —Unverified | 0 | 0 |
| Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens | Dec 12, 2023 | HallucinationPosition | —Unverified | 0 | 0 |
| Visual-based Positioning and Pose Estimation | Apr 20, 2022 | Pose EstimationPosition | —Unverified | 0 | 0 |
| Visual In-Context Learning for Large Vision-Language Models | Feb 18, 2024 | In-Context LearningPosition | —Unverified | 0 | 0 |
| Visual Layout Composer: Image-Vector Dual Diffusion Model for Design Layout Generation | Jan 1, 2024 | Layout DesignLayout Generation | —Unverified | 0 | 0 |
| Visual Modulation of Human Responses to Support Surface Translation | Mar 5, 2021 | PositionTranslation | —Unverified | 0 | 0 |
| Visual Odometry for RGB-D Cameras | Mar 28, 2022 | PositionVisual Odometry | —Unverified | 0 | 0 |
| Visual Place Recognition | Nov 26, 2022 | PositionVisual Place Recognition | —Unverified | 0 | 0 |
| Visual-tactile Fusion for Transparent Object Grasping in Complex Backgrounds | Nov 30, 2022 | ClassificationDataset Generation | —Unverified | 0 | 0 |
| Visual Tracking Using Pertinent Patch Selection and Masking | Jun 1, 2014 | PositionVisual Tracking | —Unverified | 0 | 0 |
| Visual Whole-Body Control for Legged Loco-Manipulation | Mar 25, 2024 | Position | —Unverified | 0 | 0 |
| ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers | May 26, 2025 | cross-modal alignmentPosition | —Unverified | 0 | 0 |
| ViT-LSLA: Vision Transformer with Light Self-Limited-Attention | Oct 31, 2022 | Position | —Unverified | 0 | 0 |
| VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image | Apr 20, 2025 | 3D Interacting Hand Pose EstimationComputational Efficiency | —Unverified | 0 | 0 |
| Volumetric Supervised Contrastive Learning for Seismic Semantic Segmentation | Jun 16, 2022 | Contrastive LearningPosition | —Unverified | 0 | 0 |
| VQ3D: Learning a 3D-Aware Generative Model on ImageNet | Feb 14, 2023 | DecoderNeRF | —Unverified | 0 | 0 |
| VR IQA NET: Deep Virtual Reality Image Quality Assessment using Adversarial Learning | Apr 11, 2018 | Image Quality AssessmentPosition | —Unverified | 0 | 0 |
| Waveguide Division Multiple Access for Pinching-Antenna Systems (PASS) | Feb 25, 2025 | Position | —Unverified | 0 | 0 |
| Wavelet-based Positional Representation for Long Context | Feb 4, 2025 | Position | —Unverified | 0 | 0 |
| Wayfinding and cognitive maps for pedestrian models | Feb 5, 2016 | Position | —Unverified | 0 | 0 |