| Learning to Ground Instructional Articles in Videos through Narrations | Jun 6, 2023 | ArticlesVideo Alignment | —Unverified | 0 |
| Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion | May 31, 2023 | RetrievalSelf-Supervised Learning | —Unverified | 0 |
| Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation | May 18, 2023 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| Edit As You Wish: Video Caption Editing with Multi-grained User Control | May 15, 2023 | AttributePosition | CodeCode Available | 0 |
| Video alignment using unsupervised learning of local and global features | Apr 13, 2023 | Dynamic Time WarpingHuman Detection | —Unverified | 0 |
| Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models | Mar 30, 2023 | Video AlignmentVideo Editing | CodeCode Available | 2 |
| Aligning Step-by-Step Instructional Diagrams to Video Demonstrations | Mar 24, 2023 | Contrastive LearningImage Retrieval | CodeCode Available | 0 |
| VADER: Video Alignment Differencing and Retrieval | Mar 23, 2023 | MisinformationRetrieval | —Unverified | 0 |
| Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos | Mar 22, 2023 | Representation LearningSentence | CodeCode Available | 1 |
| Weakly-supervised Representation Learning for Video Alignment and Analysis | Feb 8, 2023 | Representation LearningVideo Alignment | —Unverified | 0 |
| PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval | Jan 1, 2023 | Representation LearningRetrieval | —Unverified | 0 |
| Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations | Dec 6, 2022 | Action ClassificationContrastive Learning | —Unverified | 0 |
| Learning a Grammar Inducer from Massive Uncurated Instructional Videos | Oct 22, 2022 | Language AcquisitionVideo Alignment | CodeCode Available | 1 |
| Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space | Jun 23, 2022 | Action Recognitionimage-classification | CodeCode Available | 1 |
| Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning | Mar 28, 2022 | Action ClassificationContrastive Learning | CodeCode Available | 1 |
| Learning by Aligning Videos in Time | Mar 31, 2021 | Representation LearningRetrieval | —Unverified | 0 |
| Normalized Human Pose Features for Human Action Video Alignment | Jan 1, 2021 | Action RecognitionMetric Learning | —Unverified | 0 |
| View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose | Oct 23, 2020 | 3D Pose EstimationAction Recognition | CodeCode Available | 0 |
| View-Invariant Probabilistic Embedding for Human Pose | Dec 2, 2019 | Action RecognitionPose Retrieval | CodeCode Available | 0 |
| Adversarial Skill Networks: Unsupervised Robot Skill Learning from Video | Oct 21, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Temporal Cycle-Consistency Learning | Apr 16, 2019 | Anomaly DetectionRepresentation Learning | CodeCode Available | 0 |
| Shot-by-Shot Movie Version Comparison | Dec 1, 2018 | Video Alignment | —Unverified | 0 |
| Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding Tasks | Sep 1, 2018 | Video AlignmentVideo Recognition | —Unverified | 0 |
| Dynamic Temporal Alignment of Speech to Lips | Aug 19, 2018 | Constrained Lip-synchronizationVideo Alignment | CodeCode Available | 0 |
| Learning to Align Images using Weak Geometric Supervision | Aug 4, 2018 | Video Alignment | —Unverified | 0 |