ViSeRet: A simple yet effective approach to moment retrieval via fine-grained video segmentation Oct 11, 2021 Moment Retrieval Retrieval
— Unverified 0VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending May 22, 2023 Question Answering Retrieval
— Unverified 0VL-BEiT: Generative Vision-Language Pretraining Jun 2, 2022 image-classification Image Classification
— Unverified 0VLMAE: Vision-Language Masked Autoencoder Aug 19, 2022 Image-text Retrieval Language Modeling
— Unverified 0VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching Jan 1, 2023 Image-text matching Image-text Retrieval
— Unverified 0Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval Aug 23, 2018 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0Webly Supervised Joint Embedding for Cross-Modal lmage-Text Retrieval Oct 1, 2018 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0What Makes a Top-Performing Precision Medicine Search Engine? Tracing Main System Features in a Systematic Way Jun 4, 2020 Retrieval SMAC
— Unverified 0When are Lemons Purple? The Concept Association Bias of Vision-Language Models Dec 22, 2022 Attribute image-classification
— Unverified 0Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding Aug 18, 2017 Keyword Spotting Optical Character Recognition (OCR)
— Unverified 0XGPT: Cross-modal Generative Pre-Training for Image Captioning Mar 3, 2020 Data Augmentation Denoising
— Unverified 0Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representation Sep 29, 2021 Representation Learning Retrieval
— Unverified 0Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations Oct 14, 2021 Representation Learning Retrieval
— Unverified 0Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning Oct 12, 2023 Image Captioning Image-text Retrieval
— Unverified 0PolySmart @ TRECVid 2024 Medical Video Question Answering Dec 20, 2024 Question Answering Retrieval
— Unverified 0Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training Jun 25, 2021 Image-text Retrieval Question Answering
— Unverified 0Progressive Learning for Image Retrieval with Hybrid-Modality Queries Apr 24, 2022 Image Retrieval Image-text Retrieval
— Unverified 0Progressive Local Alignment for Medical Multimodal Pre-training Feb 25, 2025 Contrastive Learning Image-text Retrieval
— Unverified 0Prompt-based Learning for Unpaired Image Captioning May 26, 2022 Image Captioning Image-text Retrieval
— Unverified 0Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations Sep 11, 2024 Image-text Retrieval Text Retrieval
— Unverified 0QBD-RankedDataGen: Generating Custom Ranked Datasets for Improving Query-By-Document Search Using LLM-Reranking with Reduced Human Effort May 7, 2025 Information Retrieval Reranking
— Unverified 0RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation Jun 20, 2024 Information Retrieval Retrieval
— Unverified 0RECLIP: Resource-efficient CLIP by Training with Small Images Apr 12, 2023 Contrastive Learning Image-text Retrieval
— Unverified 0Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval Mar 16, 2024 Image Retrieval Retrieval
— Unverified 0Re-Imagen: Retrieval-Augmented Text-to-Image Generator Sep 29, 2022 Image Generation Image-text Retrieval
— Unverified 0Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval May 22, 2025 cross-modal alignment Image-text Retrieval
— Unverified 0Retaining Knowledge and Enhancing Long-Text Representations in CLIP through Dual-Teacher Distillation Jan 1, 2025 image-classification Image Classification
— Unverified 0Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment Jan 1, 2025 Relation Retrieval
— Unverified 0Partial Scene Text Retrieval Nov 15, 2024 Multiple Instance Learning Retrieval
Code Code Available 0Enhancing Image-Text Matching with Adaptive Feature Aggregation Jan 18, 2024 Image-text matching Image-text Retrieval
Code Code Available 0PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts May 24, 2023 Dialogue State Tracking Image Retrieval
Code Code Available 0Pre-trained Language Models Can be Fully Zero-Shot Learners Dec 14, 2022 Retrieval text-classification
Code Code Available 0OTE: Exploring Accurate Scene Text Recognition Using One Token Jan 1, 2024 Decoder Scene Text Recognition
Code Code Available 0GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models Jul 30, 2024 Image to text Image-to-Text Retrieval
Code Code Available 0ProCIS: A Benchmark for Proactive Retrieval in Conversations May 10, 2024 Retrieval Text Retrieval
Code Code Available 0From Unimodal to Multimodal: Scaling up Projectors to Align Modalities Sep 28, 2024 Image-text Retrieval Semantic Similarity
Code Code Available 0Which Country Is This? Automatic Country Ranking of Street View Photos Jun 11, 2024 Retrieval Text Retrieval
Code Code Available 0Attacking Attention of Foundation Models Disrupts Downstream Tasks Jun 3, 2025 Depth Estimation Image-text Retrieval
Code Code Available 0On Using GUI Interaction Data to Improve Text Retrieval-based Bug Localization Oct 12, 2023 Information Retrieval Retrieval
Code Code Available 0Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval Jul 17, 2024 Image-text Retrieval Object
Code Code Available 0News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation Jun 18, 2024 Cross-Lingual Transfer Domain Adaptation
Code Code Available 0WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset Jul 20, 2021 Articles Conditional Text Generation
Code Code Available 0A Hybrid Retrieval-Generation Neural Conversation Model Apr 19, 2019 Diversity model
Code Code Available 0ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors Feb 20, 2025 AudioCaps Contrastive Learning
Code Code Available 0Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Jul 2, 2024 Few-Shot Learning Language Modeling
Code Code Available 0Image Chat: Engaging Grounded Conversations Nov 2, 2018 Text Retrieval
Code Code Available 0Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning May 7, 2024 Benchmarking Contrastive Learning
Code Code Available 0Text Retrieval with Multi-Stage Re-Ranking Models Nov 14, 2023 Language Modeling Language Modelling
Code Code Available 0Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval Nov 5, 2021 Image-text Retrieval Retrieval
Code Code Available 0FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis Jul 29, 2024 Image-text Retrieval Model Selection
Code Code Available 0