| Towards A Better Metric for Text-to-Video Generation | Jan 15, 2024 | Mixture-of-ExpertsText-to-Video Generation | —Unverified | 0 |
| Learning to Predict Activity Progress by Self-Supervised Video Alignment | Jan 1, 2024 | Representation LearningVideo Alignment | —Unverified | 0 |
| Frequency-aware Event-based Video Deblurring for Real-World Motion Blur | Jan 1, 2024 | DeblurringVideo Alignment | —Unverified | 0 |
| STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment | Oct 12, 2023 | Continual LearningRepresentation Learning | —Unverified | 0 |
| Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment | Jul 24, 2023 | RetrievalText to Video Retrieval | —Unverified | 0 |
| A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference | Jun 26, 2023 | Video Alignment | CodeCode Available | 0 |
| ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer | Jun 26, 2023 | Click-Through Rate PredictionDynamic Time Warping | —Unverified | 0 |
| Learning to Ground Instructional Articles in Videos through Narrations | Jun 6, 2023 | ArticlesVideo Alignment | —Unverified | 0 |
| Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion | May 31, 2023 | RetrievalSelf-Supervised Learning | —Unverified | 0 |
| Edit As You Wish: Video Caption Editing with Multi-grained User Control | May 15, 2023 | AttributePosition | CodeCode Available | 0 |
| Video alignment using unsupervised learning of local and global features | Apr 13, 2023 | Dynamic Time WarpingHuman Detection | —Unverified | 0 |
| Aligning Step-by-Step Instructional Diagrams to Video Demonstrations | Mar 24, 2023 | Contrastive LearningImage Retrieval | CodeCode Available | 0 |
| VADER: Video Alignment Differencing and Retrieval | Mar 23, 2023 | MisinformationRetrieval | —Unverified | 0 |
| Weakly-supervised Representation Learning for Video Alignment and Analysis | Feb 8, 2023 | Representation LearningVideo Alignment | —Unverified | 0 |
| PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval | Jan 1, 2023 | Representation LearningRetrieval | —Unverified | 0 |
| Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations | Dec 6, 2022 | Action ClassificationContrastive Learning | —Unverified | 0 |
| Learning by Aligning Videos in Time | Mar 31, 2021 | Representation LearningRetrieval | —Unverified | 0 |
| Normalized Human Pose Features for Human Action Video Alignment | Jan 1, 2021 | Action RecognitionMetric Learning | —Unverified | 0 |
| View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose | Oct 23, 2020 | 3D Pose EstimationAction Recognition | CodeCode Available | 0 |
| View-Invariant Probabilistic Embedding for Human Pose | Dec 2, 2019 | Action RecognitionPose Retrieval | CodeCode Available | 0 |
| Adversarial Skill Networks: Unsupervised Robot Skill Learning from Video | Oct 21, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Temporal Cycle-Consistency Learning | Apr 16, 2019 | Anomaly DetectionRepresentation Learning | CodeCode Available | 0 |
| Shot-by-Shot Movie Version Comparison | Dec 1, 2018 | Video Alignment | —Unverified | 0 |
| Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding Tasks | Sep 1, 2018 | Video AlignmentVideo Recognition | —Unverified | 0 |
| Dynamic Temporal Alignment of Speech to Lips | Aug 19, 2018 | Constrained Lip-synchronizationVideo Alignment | CodeCode Available | 0 |