| Book2Movie: Aligning Video Scenes With Book Chapters | Jun 1, 2015 | Video Alignment | —Unverified | 0 | 0 |
| ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer | Jun 26, 2023 | Click-Through Rate PredictionDynamic Time Warping | —Unverified | 0 | 0 |
| DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models | May 11, 2025 | parameter-efficient fine-tuningVideo Alignment | —Unverified | 0 | 0 |
| DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation | Apr 21, 2025 | AttributeDenoising | —Unverified | 0 | 0 |
| FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing | Mar 10, 2024 | Image GenerationText-to-Video Editing | —Unverified | 0 | 0 |
| Frequency-aware Event-based Video Deblurring for Real-World Motion Blur | Jan 1, 2024 | DeblurringVideo Alignment | —Unverified | 0 | 0 |
| Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Dec 3, 2024 | ObjectOffline RL | —Unverified | 0 | 0 |
| Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content | Oct 10, 2024 | Video AlignmentVideo Generation | —Unverified | 0 | 0 |
| Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion | May 31, 2023 | RetrievalSelf-Supervised Learning | —Unverified | 0 | 0 |
| Learning by Aligning Videos in Time | Mar 31, 2021 | Representation LearningRetrieval | —Unverified | 0 | 0 |
| Learning Robust Video Synchronization without Annotations | Oct 19, 2016 | Video AlignmentVideo Synchronization | —Unverified | 0 | 0 |
| Learning to Align Images using Weak Geometric Supervision | Aug 4, 2018 | Video Alignment | —Unverified | 0 | 0 |
| Learning to Ground Instructional Articles in Videos through Narrations | Jun 6, 2023 | ArticlesVideo Alignment | —Unverified | 0 | 0 |
| Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment | Sep 22, 2024 | Contrastive Learningcross-modal alignment | —Unverified | 0 | 0 |
| Learning to Predict Activity Progress by Self-Supervised Video Alignment | Jan 1, 2024 | Representation LearningVideo Alignment | —Unverified | 0 | 0 |
| STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment | Oct 12, 2023 | Continual LearningRepresentation Learning | —Unverified | 0 | 0 |
| Normalized Human Pose Features for Human Action Video Alignment | Jan 1, 2021 | Action RecognitionMetric Learning | —Unverified | 0 | 0 |
| PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval | Jan 1, 2023 | Representation LearningRetrieval | —Unverified | 0 | 0 |
| Road Detection via On--line Label Transfer | Dec 10, 2014 | Pedestrian Detectionvalid | —Unverified | 0 | 0 |
| Scaling Up Video Summarization Pretraining with Large Language Models | Apr 4, 2024 | Video AlignmentVideo Summarization | —Unverified | 0 | 0 |
| Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations | Dec 6, 2022 | Action ClassificationContrastive Learning | —Unverified | 0 | 0 |
| Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports | Jul 11, 2024 | Video Alignment | —Unverified | 0 | 0 |
| Shot-by-Shot Movie Version Comparison | Dec 1, 2018 | Video Alignment | —Unverified | 0 | 0 |
| Shuffle and Learn: Unsupervised Learning using Temporal Order Verification | Mar 28, 2016 | Action RecognitionPose Estimation | —Unverified | 0 | 0 |
| Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance | Dec 24, 2024 | Audio GenerationVideo Alignment | —Unverified | 0 | 0 |