| Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis | Nov 26, 2024 | Decodermultimodal generation | —Unverified | 0 | 0 |
| VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy | Jun 17, 2025 | Decision MakingSemantic Segmentation | —Unverified | 0 | 0 |
| Visual Image Reconstruction from Brain Activity via Latent Representation | May 13, 2025 | Early ClassificationImage Reconstruction | —Unverified | 0 | 0 |
| ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers | May 26, 2025 | cross-modal alignmentPosition | —Unverified | 0 | 0 |
| VQ-AR: Vector Quantized Autoregressive Probabilistic Time Series Forecasting | May 31, 2022 | Decision MakingInductive Bias | —Unverified | 0 | 0 |
| WeLM: A Well-Read Pre-trained Language Model for Chinese | Sep 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| What Matters for Model Merging at Scale? | Oct 4, 2024 | modelTask Arithmetic | —Unverified | 0 | 0 |
| What Matters to You? Towards Visual Representation Alignment for Robot Learning | Oct 11, 2023 | Zero-shot Generalization | —Unverified | 0 | 0 |
| WHISTRESS: Enriching Transcriptions with Sentence Stress Detection | May 25, 2025 | SentenceZero-shot Generalization | —Unverified | 0 | 0 |
| WiFo: Wireless Foundation Model for Channel Prediction | Dec 12, 2024 | modelMulti-Task Learning | —Unverified | 0 | 0 |
| Words in Motion: Extracting Interpretable Control Vectors for Motion Transformers | Jun 17, 2024 | Motion ForecastingZero-shot Generalization | —Unverified | 0 | 0 |
| WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing | Feb 17, 2025 | Anomaly DetectionImage Segmentation | —Unverified | 0 | 0 |
| ZeroPrompt: Scaling Prompt-Based Pretraining to 1,000 Tasks Improves Zero-Shot Generalization | Jan 18, 2022 | Zero-shot GeneralizationZero-Shot Learning | —Unverified | 0 | 0 |
| Zero-shot Audio Source Separation through Query-based Learningfrom Weakly-labeled Data | Dec 15, 2021 | Audio Source SeparationEvent Detection | —Unverified | 0 | 0 |
| Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study | Mar 28, 2025 | Domain GeneralizationImage Segmentation | —Unverified | 0 | 0 |
| Zero-Shot Generalization for Blockage Localization in mmWave Communication | Dec 18, 2024 | Self-Supervised LearningZero-shot Generalization | —Unverified | 0 | 0 |
| Zero-shot Generalization in Dialog State Tracking through Generative Question Answering | Jan 20, 2021 | dialog state trackingDomain Adaptation | —Unverified | 0 | 0 |
| Zero-Shot Generalization of Vision-Based RL Without Data Augmentation | Oct 9, 2024 | Data AugmentationDisentanglement | —Unverified | 0 | 0 |
| Zero-Shot Monocular Scene Flow Estimation in the Wild | Jan 17, 2025 | Depth EstimationPrediction | —Unverified | 0 | 0 |
| Zero-Shot Object-Centric Representation Learning | Aug 17, 2024 | ObjectObject Discovery | —Unverified | 0 | 0 |
| Zero-Shot Trajectory Planning for Signal Temporal Logic Tasks | Jan 23, 2025 | Trajectory PlanningZero-shot Generalization | —Unverified | 0 | 0 |
| ZeroVO: Visual Odometry with Minimal Assumptions | Jun 9, 2025 | Autonomous DrivingCamera Calibration | —Unverified | 0 | 0 |