SOTAVerified

Video Alignment

Papers

Showing 5183 of 83 papers

TitleStatusHype
Normalized Human Pose Features for Human Action Video Alignment0
PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval0
Road Detection via On--line Label Transfer0
Scaling Up Video Summarization Pretraining with Large Language Models0
Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations0
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports0
Shot-by-Shot Movie Version Comparison0
Shuffle and Learn: Unsupervised Learning using Temporal Order Verification0
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance0
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets0
Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding Tasks0
The Effects of Short Video-Sharing Services on Video Copy Detection0
Towards A Better Metric for Text-to-Video Generation0
ACCURATE METHOD OF TEMPORAL-SHIFT ESTIMATION FOR 3D VIDEO0
Video alignment using unsupervised learning of local and global features0
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement0
Weakly-supervised Representation Learning for Video Alignment and Analysis0
Aligning Step-by-Step Instructional Diagrams to Video DemonstrationsCode0
Temporal Cycle-Consistency LearningCode0
Neuro-Symbolic Evaluation of Text-to-Video Models using Formal VerificationCode0
View-Invariant, Occlusion-Robust Probabilistic Embedding for Human PoseCode0
Listen Then See: Video Alignment with Speaker AttentionCode0
Dynamic Temporal Alignment of Speech to LipsCode0
Self-Supervised Contrastive Learning for Videos using Differentiable Local AlignmentCode0
View-Invariant Probabilistic Embedding for Human PoseCode0
A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step InferenceCode0
Learning from Video and Text via Large-Scale Discriminative ClusteringCode0
Deep Understanding of Sign Language for Sign to Subtitle AlignmentCode0
Adversarial Skill Networks: Unsupervised Robot Skill Learning from VideoCode0
Sound Bridge: Associating Egocentric and Exocentric Videos via Audio CuesCode0
LAMV: Learning to Align and Match Videos With Kernelized Temporal LayersCode0
Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified ModelCode0
Edit As You Wish: Video Caption Editing with Multi-grained User ControlCode0
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.