SOTAVerified

Video Alignment

Papers

Showing 2650 of 83 papers

TitleStatusHype
Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D SpaceCode1
Frame-wise Action Representations for Long Videos via Sequence Contrastive LearningCode1
Time-Contrastive Networks: Self-Supervised Learning from VideoCode1
Audio-Sync Video Generation with Multi-Stream Temporal Control0
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models0
DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation0
Deep Understanding of Sign Language for Sign to Subtitle AlignmentCode0
Sound Bridge: Associating Egocentric and Exocentric Videos via Audio CuesCode0
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance0
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback0
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement0
Neuro-Symbolic Evaluation of Text-to-Video Models using Formal VerificationCode0
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content0
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment0
Self-Supervised Contrastive Learning for Videos using Differentiable Local AlignmentCode0
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets0
Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified ModelCode0
A Comprehensive Review of Few-shot Action Recognition0
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports0
Align and Aggregate: Compositional Reasoning with Video Alignment and Answer Aggregation for Video Question-Answering0
Listen Then See: Video Alignment with Speaker AttentionCode0
AniClipart: Clipart Animation with Text-to-Video Priors0
Scaling Up Video Summarization Pretraining with Large Language Models0
The Effects of Short Video-Sharing Services on Video Copy Detection0
FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.