| AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI | Jan 3, 2024 | Video AlignmentVideo Generation | CodeCode Available | 2 |
| Frequency-aware Event-based Video Deblurring for Real-World Motion Blur | Jan 1, 2024 | DeblurringVideo Alignment | —Unverified | 0 |
| Learning to Predict Activity Progress by Self-Supervised Video Alignment | Jan 1, 2024 | Representation LearningVideo Alignment | —Unverified | 0 |
| EvalCrafter: Benchmarking and Evaluating Large Video Generation Models | Oct 17, 2023 | BenchmarkingLanguage Modelling | CodeCode Available | 1 |
| STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment | Oct 12, 2023 | Continual LearningRepresentation Learning | —Unverified | 0 |
| Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation | Sep 27, 2023 | GPUText-to-Video Generation | CodeCode Available | 3 |
| Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment | Jul 24, 2023 | RetrievalText to Video Retrieval | —Unverified | 0 |
| A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference | Jun 26, 2023 | Video Alignment | CodeCode Available | 0 |
| ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer | Jun 26, 2023 | Click-Through Rate PredictionDynamic Time Warping | —Unverified | 0 |
| Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers | Jun 15, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |