| TMCIR: Token Merge Benefits Composed Image Retrieval | Apr 15, 2025 | Contrastive Learningcross-modal alignment | —Unverified | 0 | 0 |
| TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval | Sep 28, 2022 | cross-modal alignmentRetrieval | —Unverified | 0 | 0 |
| TOT: Topology-Aware Optimal Transport For Multimodal Hate Detection | Feb 27, 2023 | cross-modal alignment | —Unverified | 0 | 0 |
| Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images | Aug 31, 2023 | 3D Shape GenerationContrastive Learning | —Unverified | 0 | 0 |
| Towards LLM-Centric Multimodal Fusion: A Survey on Integration Strategies and Techniques | Jun 5, 2025 | cross-modal alignmentLarge Language Model | —Unverified | 0 | 0 |
| Transformer-based Spatial Grounding: A Comprehensive Survey | Jul 17, 2025 | cross-modal alignmentSurvey | —Unverified | 0 | 0 |
| Translation, Scale and Rotation: Cross-Modal Alignment Meets RGB-Infrared Vehicle Detection | Sep 28, 2022 | 2D Object Detectioncross-modal alignment | —Unverified | 0 | 0 |
| TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation | Jun 26, 2025 | cross-modal alignmentInteractive Segmentation | —Unverified | 0 | 0 |
| TS-HTFA: Advancing Time Series Forecasting via Hierarchical Text-Free Alignment with Large Language Models | Sep 23, 2024 | Contrastive Learningcross-modal alignment | —Unverified | 0 | 0 |
| UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation | Jun 4, 2025 | cross-modal alignmentLipreading | —Unverified | 0 | 0 |