| Transferable and Distributed User Association Policies for 5G and Beyond Networks | Jun 4, 2021 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |
| Quantifying uncertainty in lung cancer segmentation with foundation models applied to mixed-domain datasets | Mar 19, 2024 | Computed Tomography (CT)Segmentation | —Unverified | 0 |
| Turbocharging Solution Concepts: Solving NEs, CEs and CCEs with Neural Equilibrium Solvers | Oct 17, 2022 | Zero-shot Generalization | —Unverified | 0 |
| Unifying Few- and Zero-Shot Egocentric Action Recognition | May 27, 2020 | Action RecognitionBenchmarking | —Unverified | 0 |
| Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos | Nov 14, 2024 | 4D reconstructionSelf-Supervised Learning | —Unverified | 0 |
| UniIR: Training and Benchmarking Universal Multimodal Information Retrievers | Nov 28, 2023 | BenchmarkingInformation Retrieval | —Unverified | 0 |
| Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models | Mar 25, 2025 | TranslationZero-shot Generalization | —Unverified | 0 |
| Unsupervised Discovery of Object-Centric Neural Fields | Feb 12, 2024 | ObjectObject Discovery | —Unverified | 0 |
| Unsupervised Prompt Tuning for Text-Driven Object Detection | Jan 1, 2023 | Data AugmentationObject | —Unverified | 0 |
| UTSD: Unified Time Series Diffusion Model | Dec 4, 2024 | Denoisingmodel | —Unverified | 0 |
| Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning | Nov 4, 2021 | Hierarchical Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models | Jul 8, 2025 | Future predictionLarge Language Model | —Unverified | 0 |
| Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis | Nov 26, 2024 | Decodermultimodal generation | —Unverified | 0 |
| VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy | Jun 17, 2025 | Decision MakingSemantic Segmentation | —Unverified | 0 |
| Visual Image Reconstruction from Brain Activity via Latent Representation | May 13, 2025 | Early ClassificationImage Reconstruction | —Unverified | 0 |
| ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers | May 26, 2025 | cross-modal alignmentPosition | —Unverified | 0 |
| VQ-AR: Vector Quantized Autoregressive Probabilistic Time Series Forecasting | May 31, 2022 | Decision MakingInductive Bias | —Unverified | 0 |
| WeLM: A Well-Read Pre-trained Language Model for Chinese | Sep 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What Matters for Model Merging at Scale? | Oct 4, 2024 | modelTask Arithmetic | —Unverified | 0 |
| What Matters to You? Towards Visual Representation Alignment for Robot Learning | Oct 11, 2023 | Zero-shot Generalization | —Unverified | 0 |
| WHISTRESS: Enriching Transcriptions with Sentence Stress Detection | May 25, 2025 | SentenceZero-shot Generalization | —Unverified | 0 |
| WiFo: Wireless Foundation Model for Channel Prediction | Dec 12, 2024 | modelMulti-Task Learning | —Unverified | 0 |
| Words in Motion: Extracting Interpretable Control Vectors for Motion Transformers | Jun 17, 2024 | Motion ForecastingZero-shot Generalization | —Unverified | 0 |
| WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing | Feb 17, 2025 | Anomaly DetectionImage Segmentation | —Unverified | 0 |
| ZeroPrompt: Scaling Prompt-Based Pretraining to 1,000 Tasks Improves Zero-Shot Generalization | Jan 18, 2022 | Zero-shot GeneralizationZero-Shot Learning | —Unverified | 0 |