Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning Jun 26, 2024 Contrastive Learning Cross-Modal Retrieval
Code Code Available 05 MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding Jan 11, 2020 Image Captioning Image-text Retrieval
Code Code Available 05 Variational Deep Semantic Hashing for Text Documents Aug 11, 2017 Image Retrieval Information Retrieval
Code Code Available 05 Shallow Cross-Encoders for Low-Latency Retrieval Mar 29, 2024 CPU GPU
Code Code Available 05 VL-Taboo: An Analysis of Attribute-based Zero-shot Capabilities of Vision-Language Models Sep 12, 2022 Attribute Image-text Retrieval
Code Code Available 05 Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval Nov 5, 2021 Image-text Retrieval Retrieval
Code Code Available 05 Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval May 26, 2023 Image-text Retrieval Retrieval
Code Code Available 05 WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset Jul 20, 2021 Articles Conditional Text Generation
Code Code Available 05 GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning Oct 20, 2024 Image Retrieval Image-text Retrieval
Code Code Available 05 Single Shot Scene Text Retrieval Aug 27, 2018 Image Retrieval Retrieval
Code Code Available 05 Single-Stream Multi-Level Alignment for Vision-Language Pretraining Mar 27, 2022 Image-text Retrieval Question Answering
Code Code Available 05 NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings Jan 7, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 05 Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Apr 1, 2022 Diversity Image Captioning
Code Code Available 05 Sparse, Dense, and Attentional Representations for Text Retrieval May 1, 2020 Open-Domain Question Answering Retrieval
Code Code Available 05 HADA: A Graph-based Amalgamation Framework in Image-text Retrieval Jan 11, 2023 Graph Neural Network Image Retrieval
Code Code Available 05 Intra-Modal Constraint Loss For Image-Text Retrieval Jul 11, 2022 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 05 Invisible Relevance Bias: Text-Image Retrieval Models Prefer AI-Generated Images Nov 23, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 05 Video-Text Retrieval by Supervised Sparse Multi-Grained Learning Feb 19, 2023 Representation Learning Retrieval
Code Code Available 05 It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports Jan 22, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 05 MeTA: A Unified Toolkit for Text Retrieval and Analysis Aug 1, 2016 Document Classification Information Retrieval
Code Code Available 05 Corpus-Level End-to-End Exploration for Interactive Systems Nov 23, 2019 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 05 Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages Jun 29, 2023 Image-text Retrieval Machine Translation
Code Code Available 05 Design of the topology for contrastive visual-textual alignment Sep 5, 2022 Contrastive Learning Image-to-Text Retrieval
Code Code Available 05 MODOC: A Modular Interface for Flexible Interlinking of Text Retrieval and Text Generation Functions Aug 26, 2024 Information Retrieval Retrieval
Code Code Available 05 ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities Nov 16, 2021 Articles Face Recognition
Code Code Available 05 Modelling Stopping Criteria for Search Results using Poisson Processes Sep 13, 2019 Retrieval Text Retrieval
Code Code Available 05 ERU-KG: Efficient Reference-aligned Unsupervised Keyphrase Generation May 30, 2025 Informativeness Keyphrase Generation
Code Code Available 05 Symmetric Multi-Similarity Loss for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2024 Jun 18, 2024 Ensemble Learning Multi-Instance Retrieval
Code Code Available 05 Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes Jan 23, 2025 Emotion Classification Image Captioning
Code Code Available 05 Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task Oct 8, 2019 Cross-Modal Retrieval Image to text
Code Code Available 05 A Binary Variational Autoencoder for Hashing Oct 22, 2019 Quantization Retrieval
Code Code Available 05 Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval Aug 23, 2018 Cross-Modal Retrieval Image-text Retrieval
— Unverified 00 Webly Supervised Joint Embedding for Cross-Modal lmage-Text Retrieval Oct 1, 2018 Cross-Modal Retrieval Image-text Retrieval
— Unverified 00 What Makes a Top-Performing Precision Medicine Search Engine? Tracing Main System Features in a Systematic Way Jun 4, 2020 Retrieval SMAC
— Unverified 00 When are Lemons Purple? The Concept Association Bias of Vision-Language Models Dec 22, 2022 Attribute image-classification
— Unverified 00 Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding Aug 18, 2017 Keyword Spotting Optical Character Recognition (OCR)
— Unverified 00 XGPT: Cross-modal Generative Pre-Training for Image Captioning Mar 3, 2020 Data Augmentation Denoising
— Unverified 00 Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representation Sep 29, 2021 Representation Learning Retrieval
— Unverified 00 Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations Oct 14, 2021 Representation Learning Retrieval
— Unverified 00 Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning Oct 12, 2023 Image Captioning Image-text Retrieval
— Unverified 00 Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes Aug 1, 2021 Benchmarking Binary Classification
— Unverified 00 Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation Aug 2, 2024 Image-text Retrieval Retrieval
— Unverified 00 MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations Mar 2, 2025 image-classification Image Classification
— Unverified 00 Towards Cross-modal Retrieval in Chinese Cultural Heritage Documents: Dataset and Solution May 16, 2025 Cross-Modal Retrieval Image to text
— Unverified 00 ABC: Achieving Better Control of Multimodal Embeddings using VLMs Mar 1, 2025 Image to text Image-to-Text Retrieval
— Unverified 00 ABC-SG: A New Artificial Bee Colony Algorithm-Based Distance of Sequential Data Using Sigma Grams Dec 5, 2013 Retrieval Text Retrieval
— Unverified 00 Accept the Modality Gap: An Exploration in the Hyperbolic Space Jan 1, 2024 Image to text Image-to-Text Retrieval
— Unverified 00 ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling Jun 25, 2024 Cross-Modal Retrieval Natural Language Queries
— Unverified 00 A Comparative Study of Text Retrieval Models on DaReCzech Nov 19, 2024 Information Retrieval Machine Translation
— Unverified 00 Active Learning for Finely-Categorized Image-Text Retrieval by Selecting Hard Negative Unpaired Samples May 25, 2024 Active Learning Image-text Retrieval
— Unverified 00