Video as the New Language for Real-World Decision Making Feb 27, 2024 Decision Making In-Context Learning
— Unverified 00 Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound Aug 21, 2024 Audio Generation Audio Synthesis
— Unverified 00 Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition Aug 22, 2018 Action Recognition Activity Recognition
— Unverified 00 Video Representation Learning by Recognizing Temporal Transformations Jul 21, 2020 Action Recognition Representation Learning
— Unverified 00 Video Transformers: A Survey Jan 16, 2022 Action Classification Self-Supervised Learning
— Unverified 00 Video Understanding as Machine Translation Jun 12, 2020 Machine Translation Metric Learning
— Unverified 00 Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding Mar 24, 2025 8k GPU
— Unverified 00 VieSum: How Robust Are Transformer-based Models on Vietnamese Summarization? Oct 8, 2021 Abstractive Text Summarization Decoder
— Unverified 00 VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 ViewMix: Augmentation for Robust Representation in Self-Supervised Learning Sep 6, 2023 Representation Learning Self-Supervised Learning
— Unverified 00 ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation Dec 1, 2022 Image Reconstruction Self-Supervised Learning
— Unverified 00 Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation May 28, 2024 Representation Learning Self-Supervised Learning
— Unverified 00 VIGraph: Generative Self-supervised Learning for Class-Imbalanced Node Classification Nov 2, 2023 Contrastive Learning Node Classification
— Unverified 00 Vi-MIX FOR SELF-SUPERVISED VIDEO REPRESENTATION Sep 29, 2021 Action Recognition Representation Learning
— Unverified 00 Virtual Node Generation for Node Classification in Sparsely-Labeled Graphs Sep 12, 2024 Graph Learning Meta-Learning
— Unverified 00 Visible and infrared self-supervised fusion trained on a single example Jul 9, 2023 object-detection Object Detection
— Unverified 00 Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft May 9, 2024 All Language Modeling
— Unverified 00 Vision Learners Meet Web Image-Text Pairs Jan 17, 2023 Benchmarking Self-Supervised Learning
— Unverified 00 Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision Feb 16, 2022 Action Classification Action Recognition
— Unverified 00 Vision Transformers: State of the Art and Research Challenges Jul 7, 2022 3D Reconstruction Image Segmentation
— Unverified 00 Visual Lexicon: Rich Image Features in Language Space Dec 9, 2024 Image Generation Image Reconstruction
— Unverified 00 Visually Guided Self Supervised Learning of Speech Representations Jan 13, 2020 Emotion Recognition Representation Learning
— Unverified 00 Visual Representation Learning with Stochastic Frame Prediction Jun 11, 2024 Decoder Pose Tracking
— Unverified 00 Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations May 12, 2022 Object Object Localization
— Unverified 00 ViTAR: Vision Transformer with Any Resolution Mar 27, 2024 Self-Supervised Learning Semantic Segmentation
— Unverified 00 ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer May 22, 2023 Decoder Denoising
— Unverified 00 VLMs-Guided Representation Distillation for Efficient Vision-Based Reinforcement Learning Jan 1, 2025 Decision Making reinforcement-learning
— Unverified 00 VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment Dec 7, 2023 Disentanglement Self-Supervised Learning
— Unverified 00 VRMM: A Volumetric Relightable Morphable Head Model Feb 6, 2024 3D Face Reconstruction Face Reconstruction
— Unverified 00 Watching Too Much Television is Good: Self-Supervised Audio-Visual Representation Learning from Movies and TV Shows Jun 16, 2021 Contrastive Learning Representation Learning
— Unverified 00 Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR Apr 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Wav2Vec-Aug: Improved self-supervised training with limited data Jun 27, 2022 Data Augmentation Self-Supervised Learning
— Unverified 00 Wav2vec-C: A Self-supervised Model for Speech Representation Learning Mar 9, 2021 Quantization Representation Learning
— Unverified 00 Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Wavelet-Driven Masked Image Modeling: A Path to Efficient Visual Representation Mar 2, 2025 Representation Learning Self-Supervised Learning
— Unverified 00 WavFT: Acoustic model finetuning with labelled and unlabelled data Apr 1, 2022 Self-Supervised Learning
— Unverified 00 Weakly Augmented Variational Autoencoder in Time Series Anomaly Detection Jan 7, 2024 Anomaly Detection Self-Supervised Learning
— Unverified 00 Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows Mar 23, 2020 3D human pose and shape estimation 3D Human Pose Estimation
— Unverified 00 Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving Jan 1, 2023 Autonomous Driving motion prediction
— Unverified 00 Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition May 25, 2023 Denoising Self-Supervised Learning
— Unverified 00 Weakly-Supervised Surgical Phase Recognition Oct 26, 2023 Few-Shot Learning Self-Supervised Learning
— Unverified 00 WeakSTIL: Weak whole-slide image level stromal tumor infiltrating lymphocyte scores are all you need Sep 13, 2021 All Decision Making
— Unverified 00 Wearable Accelerometer Foundation Models for Health via Knowledge Distillation Dec 15, 2024 Activity Recognition cross-modal alignment
— Unverified 00 Wearable-Based Real-time Freezing of Gait Detection in Parkinson's Disease Using Self-Supervised Learning Oct 8, 2024 Self-Supervised Learning
— Unverified 00 Wearable data from subjects playing Super Mario, sitting university exams, or performing physical exercise help detect acute mood episodes via self-supervised learning Nov 7, 2023 Body Detection Emotion Recognition
— Unverified 00 WeedCLR: Weed Contrastive Learning through Visual Representations with Class-Optimized Loss in Long-Tailed Datasets Oct 19, 2023 Contrastive Learning image-classification
— Unverified 00 WeedNet: A Foundation Model-Based Global-to-Local AI Approach for Real-Time Weed Species Identification and Classification May 25, 2025 Self-Supervised Learning
— Unverified 00 Weighted Ensemble Self-Supervised Learning Nov 18, 2022 Diversity Self-Supervised Learning
— Unverified 00 WeLM: A Well-Read Pre-trained Language Model for Chinese Sep 21, 2022 Language Modeling Language Modelling
— Unverified 00 WERank: Towards Rank Degradation Prevention for Self-Supervised Learning Using Weight Regularization Feb 14, 2024 Data Augmentation Self-Supervised Learning
— Unverified 00