Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input Jun 25, 2023 Diversity Image-text Retrieval
— Unverified 0SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment Jan 4, 2024 Image Captioning image-classification
— Unverified 0Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding Jan 16, 2022 Retrieval Text Retrieval
— Unverified 0Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding Mar 11, 2022 Retrieval Text Retrieval
— Unverified 0Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval Jan 30, 2023 Language Modeling Language Modelling
— Unverified 0Tailoring Table Retrieval from a Field-aware Hybrid Matching Perspective Mar 4, 2025 Retrieval Sentence
— Unverified 0TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model Apr 14, 2024 Language Modeling Language Modelling
— Unverified 0Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval Sep 27, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Memory^3: Language Modeling with Explicit Memory Jul 1, 2024 Language Modeling Language Modelling
— Unverified 0Text Relatedness Based on a Word Thesaurus Jan 15, 2014 Clustering Retrieval
— Unverified 0Text Retrieval by Term Co-occurrences in a Query-based Vector Space Dec 1, 2016 Retrieval Sentence
— Unverified 0Text Retrieval for Language Learners: Graded Vocabulary vs. Open Learner Model Sep 1, 2021 Retrieval Text Retrieval
— Unverified 0The effects of having lists of synonyms on the performance of Afaan Oromo Text Retrieval system Mar 4, 2021 Information Retrieval Retrieval
— Unverified 0The style transformer with common knowledge optimization for image-text retrieval Mar 1, 2023 Image-text Retrieval Retrieval
— Unverified 0The Text Classification Pipeline: Starting Shallow going Deeper Dec 30, 2024 Classification Information Retrieval
— Unverified 0The VISIONE Video Search System: Exploiting Off-the-Shelf Text Search Engines for Large-Scale Video Retrieval Aug 6, 2020 Retrieval Text Retrieval
— Unverified 0TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval Sep 28, 2022 cross-modal alignment Retrieval
— Unverified 0TOME: A Two-stage Approach for Model-based Retrieval May 18, 2023 Natural Questions Retrieval
— Unverified 0Towards a Visual-Language Foundation Model for Computational Pathology Jul 24, 2023 Contrastive Learning image-classification
— Unverified 0Towards Robust Ranker for Text Retrieval Jun 16, 2022 Passage Retrieval Reranking
— Unverified 0Towards Understanding Camera Motions in Any Video Apr 21, 2025 Question Answering Text Retrieval
— Unverified 0Transformation of XML Documents with Prolog Jun 19, 2019 Retrieval Text Retrieval
— Unverified 0Transformer Based Language Models for Similar Text Retrieval and Ranking May 10, 2020 Natural Language Queries Retrieval
— Unverified 0TRAttack”:" Text Rewriting Attack Against Text Retrieval May 1, 2022 Retrieval Text Retrieval
— Unverified 0TREC 2020 Podcasts Track Overview Mar 29, 2021 Information Retrieval Retrieval
— Unverified 0TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval Jan 19, 2025 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training Apr 1, 2021 Image-text matching Image-text Retrieval
— Unverified 0UFO: A UniFied TransfOrmer for Vision-Language Representation Learning Nov 19, 2021 Image Captioning Image-text matching
— Unverified 0Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval Apr 15, 2021 Binarization Information Retrieval
— Unverified 0Unambiguous Text Localization and Retrieval for Cluttered Scenes Jul 1, 2017 Retrieval Text Retrieval
— Unverified 0Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval Sep 21, 2023 Domain Adaptation Retrieval
— Unverified 0Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval Sep 21, 2023 Domain Adaptation Retrieval
— Unverified 0Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking Sep 15, 2023 Image-text matching Re-Ranking
— Unverified 0Uncertainty-aware sign language video retrieval with probability distribution modeling May 30, 2024 Retrieval Sign Language Retrieval
— Unverified 0Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning Mar 10, 2023 Few-Shot Image Classification image-classification
— Unverified 0Understanding and Predicting Characteristics of Test Collections in Information Retrieval Dec 24, 2020 Information Retrieval Retrieval
— Unverified 0Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning May 26, 2024 Image to text Image-to-Text Retrieval
— Unverified 0Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training Aug 16, 2019 Image-text matching Image-text Retrieval
— Unverified 0Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval Sep 28, 2022 Contrastive Learning Retrieval
— Unverified 0Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation Dec 10, 2021 Image-text matching Image-text Retrieval
— Unverified 0Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval Feb 26, 2024 Retrieval Text Retrieval
— Unverified 0Unifying Multimodal Retrieval via Document Screenshot Embedding Jun 17, 2024 Language Modelling Natural Questions
— Unverified 0Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training Nov 20, 2024 Contrastive Learning image-classification
— Unverified 0UNITER: Learning UNiversal Image-TExt Representations Sep 25, 2019 Image-text matching Image-text Retrieval
— Unverified 0UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation Apr 22, 2024 Diversity Domain Adaptation
— Unverified 0V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts Mar 3, 2025 Contrastive Learning Text Retrieval
— Unverified 0V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts Jan 1, 2025 Contrastive Learning Text Retrieval
— Unverified 0Variance-Aware Loss Scheduling for Multimodal Alignment in Low-Data Settings Mar 5, 2025 Contrastive Learning Image-text Retrieval
— Unverified 0Video Editing for Video Retrieval Feb 4, 2024 Retrieval Text Retrieval
— Unverified 0ViLEM: Visual-Language Error Modeling for Image-Text Retrieval Jan 1, 2023 Contrastive Learning Image-text Retrieval
— Unverified 0