SOTAVerified

cross-modal alignment

Papers

Showing 251260 of 342 papers

TitleStatusHype
TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio Motion Embedding and Diffusion Interpolation0
TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models0
Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR0
Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge0
TMCIR: Token Merge Benefits Composed Image Retrieval0
TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval0
TOT: Topology-Aware Optimal Transport For Multimodal Hate Detection0
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images0
Towards LLM-Centric Multimodal Fusion: A Survey on Integration Strategies and Techniques0
Transformer-based Spatial Grounding: A Comprehensive Survey0
Show:102550
← PrevPage 26 of 35Next →

No leaderboard results yet.