SOTAVerified

cross-modal alignment

Papers

Showing 5160 of 342 papers

TitleStatusHype
AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech RecognitionCode1
LESS: Label-Efficient and Single-Stage Referring 3D SegmentationCode1
Boosting Masked ECG-Text Auto-Encoders as Discriminative LearnersCode1
MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression ComprehensionCode1
Beyond Uncertainty: Evidential Deep Learning for Robust Video Temporal GroundingCode1
A Survey on Facial Expression Recognition of Static and Dynamic EmotionsCode1
Advancing Multi-grained Alignment for Contrastive Language-Audio Pre-trainingCode1
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual AlignmentCode1
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change CaptioningCode1
Towards Bridging the Cross-modal Semantic Gap for Multi-modal RecommendationCode1
Show:102550
← PrevPage 6 of 35Next →

No leaderboard results yet.