SOTAVerified

Video Alignment

Papers

Showing 5183 of 83 papers

TitleStatusHype
Book2Movie: Aligning Video Scenes With Book Chapters0
ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer0
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models0
DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation0
FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing0
Frequency-aware Event-based Video Deblurring for Real-World Motion Blur0
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback0
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content0
Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion0
Learning by Aligning Videos in Time0
Learning Robust Video Synchronization without Annotations0
Learning to Align Images using Weak Geometric Supervision0
Learning to Ground Instructional Articles in Videos through Narrations0
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment0
Learning to Predict Activity Progress by Self-Supervised Video Alignment0
STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment0
Normalized Human Pose Features for Human Action Video Alignment0
PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval0
Road Detection via On--line Label Transfer0
Scaling Up Video Summarization Pretraining with Large Language Models0
Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations0
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports0
Shot-by-Shot Movie Version Comparison0
Shuffle and Learn: Unsupervised Learning using Temporal Order Verification0
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance0
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets0
Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding Tasks0
The Effects of Short Video-Sharing Services on Video Copy Detection0
Towards A Better Metric for Text-to-Video Generation0
ACCURATE METHOD OF TEMPORAL-SHIFT ESTIMATION FOR 3D VIDEO0
Video alignment using unsupervised learning of local and global features0
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement0
Weakly-supervised Representation Learning for Video Alignment and Analysis0
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.