| Towards A Better Metric for Text-to-Video Generation | Jan 15, 2024 | Mixture-of-ExpertsText-to-Video Generation | —Unverified | 0 |
| Learning to Predict Activity Progress by Self-Supervised Video Alignment | Jan 1, 2024 | Representation LearningVideo Alignment | —Unverified | 0 |
| Frequency-aware Event-based Video Deblurring for Real-World Motion Blur | Jan 1, 2024 | DeblurringVideo Alignment | —Unverified | 0 |
| STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment | Oct 12, 2023 | Continual LearningRepresentation Learning | —Unverified | 0 |
| Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment | Jul 24, 2023 | RetrievalText to Video Retrieval | —Unverified | 0 |
| A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference | Jun 26, 2023 | Video Alignment | CodeCode Available | 0 |
| ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer | Jun 26, 2023 | Click-Through Rate PredictionDynamic Time Warping | —Unverified | 0 |
| Learning to Ground Instructional Articles in Videos through Narrations | Jun 6, 2023 | ArticlesVideo Alignment | —Unverified | 0 |
| Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion | May 31, 2023 | RetrievalSelf-Supervised Learning | —Unverified | 0 |
| Edit As You Wish: Video Caption Editing with Multi-grained User Control | May 15, 2023 | AttributePosition | CodeCode Available | 0 |