SOTAVerified

Video Alignment

Papers

Showing 2650 of 83 papers

TitleStatusHype
Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified ModelCode0
A Comprehensive Review of Few-shot Action Recognition0
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports0
MiraData: A Large-Scale Video Dataset with Long Durations and Structured CaptionsCode4
Align and Aggregate: Compositional Reasoning with Video Alignment and Answer Aggregation for Video Question-Answering0
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized SoundsCode4
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference DatasetCode1
Listen Then See: Video Alignment with Speaker AttentionCode0
AniClipart: Clipart Animation with Text-to-Video Priors0
Scaling Up Video Summarization Pretraining with Large Language Models0
The Effects of Short Video-Sharing Services on Video Copy Detection0
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and CompatibilityCode3
Subjective-Aligned Dataset and Metric for Text-to-Video Quality AssessmentCode1
FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing0
Towards A Better Metric for Text-to-Video Generation0
AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AICode2
Frequency-aware Event-based Video Deblurring for Real-World Motion Blur0
Learning to Predict Activity Progress by Self-Supervised Video Alignment0
EvalCrafter: Benchmarking and Evaluating Large Video Generation ModelsCode1
STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment0
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video GenerationCode3
Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment0
A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step InferenceCode0
ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer0
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision TransformersCode1
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.