Establishing a Foundation for Tetun Ad-Hoc Text Retrieval: Stemming, Indexing, Retrieval, and Ranking Dec 16, 2024 Information Retrieval Retrieval
— Unverified 0jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images Dec 11, 2024 Contrastive Learning Cross-Modal Information Retrieval
— Unverified 0Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses Dec 11, 2024 Image-text Retrieval Question Answering
— Unverified 0Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning Dec 10, 2024 Contrastive Learning Image-text Retrieval
— Unverified 0VladVA: Discriminative Fine-tuning of LVLMs Dec 5, 2024 Image-text Retrieval Representation Learning
— Unverified 0Linq-Embed-Mistral Technical Report Dec 4, 2024 Retrieval Text Retrieval
— Unverified 0Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval Dec 3, 2024 Retrieval Text Retrieval
— Unverified 0DIR: Retrieval-Augmented Image Captioning with Comprehensive Understanding Dec 2, 2024 Caption Generation Domain Generalization
— Unverified 0Approximate Fiber Product: A Preliminary Algebraic-Geometric Perspective on Multimodal Embedding Alignment Nov 30, 2024 Image-text Retrieval Representation Learning
— Unverified 0CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives Nov 29, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Knowledge Transfer Across Modalities with Natural Language Supervision Nov 23, 2024 Image-text Retrieval Novel Concepts
— Unverified 0Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval Nov 22, 2024 Image Retrieval Reranking
— Unverified 0Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training Nov 20, 2024 Contrastive Learning image-classification
— Unverified 0CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval Nov 19, 2024 Diversity Natural Language Queries
— Unverified 0A Comparative Study of Text Retrieval Models on DaReCzech Nov 19, 2024 Information Retrieval Machine Translation
— Unverified 0BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language? Nov 19, 2024 Retrieval Text Retrieval
— Unverified 0Partial Scene Text Retrieval Nov 15, 2024 Multiple Instance Learning Retrieval
Code Code Available 0MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs Nov 4, 2024 Cross-Modal Retrieval Information Retrieval
— Unverified 0SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities Nov 4, 2024 Attribute Descriptive
— Unverified 0Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization Oct 30, 2024 Image to text Image-to-Text Retrieval
— Unverified 0Multilingual Vision-Language Pre-training for the Remote Sensing Domain Oct 30, 2024 Cross-Modal Retrieval image-classification
Code Code Available 0Do Audio-Language Models Understand Linguistic Variations? Oct 21, 2024 Contrastive Learning Natural Language Queries
— Unverified 0GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning Oct 20, 2024 Image Retrieval Image-text Retrieval
Code Code Available 0Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging Oct 19, 2024 model Semantic Textual Similarity
— Unverified 0Beyond Coarse-Grained Matching in Video-Text Retrieval Oct 16, 2024 Retrieval Text Retrieval
— Unverified 0CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning Oct 15, 2024 Image-text Retrieval Text Retrieval
— Unverified 0LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning Oct 9, 2024 Large Language Model Motion Captioning
— Unverified 0AnyAttack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models Oct 7, 2024 Image Captioning Image-text Retrieval
— Unverified 0CoLLAP: Contrastive Long-form Language-Audio Pretraining with Musical Temporal Structure Augmentation Oct 3, 2024 Contrastive Learning Form
— Unverified 0From Unimodal to Multimodal: Scaling up Projectors to Align Modalities Sep 28, 2024 Image-text Retrieval Semantic Similarity
Code Code Available 0Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization Sep 26, 2024 Image to text Image-to-Text Retrieval
— Unverified 0DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval Sep 16, 2024 AudioCaps Retrieval
— Unverified 0NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training Sep 15, 2024 Contrastive Learning cross-modal alignment
— Unverified 0Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG Sep 12, 2024 Benchmarking Question Answering
— Unverified 0Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations Sep 11, 2024 Image-text Retrieval Text Retrieval
— Unverified 0Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E5 Sep 9, 2024 Benchmarking Information Retrieval
— Unverified 0MODOC: A Modular Interface for Flexible Interlinking of Text Retrieval and Text Generation Functions Aug 26, 2024 Information Retrieval Retrieval
Code Code Available 0Mistral-SPLADE: LLMs for better Learned Sparse Retrieval Aug 20, 2024 Decoder Language Modeling
Code Code Available 0Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores Aug 19, 2024 Retrieval Semantic Textual Similarity
— Unverified 0NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality Aug 18, 2024 Retrieval Text Retrieval
— Unverified 0Mamba Retriever: Utilizing Mamba for Effective and Efficient Dense Retrieval Aug 15, 2024 Information Retrieval Mamba
— Unverified 0Pairing Clustered Inverted Indexes with kNN Graphs for Fast Approximate Retrieval over Learned Sparse Representations Aug 8, 2024 Retrieval Text Retrieval
— Unverified 0Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation Aug 2, 2024 Image-text Retrieval Retrieval
— Unverified 0GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models Jul 30, 2024 Image to text Image-to-Text Retrieval
Code Code Available 0FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis Jul 29, 2024 Image-text Retrieval Model Selection
Code Code Available 0mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval Jul 29, 2024 Contrastive Learning Reranking
— Unverified 0Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective Jul 21, 2024 Image-text Retrieval Information Retrieval
— Unverified 0Multimodal Misinformation Detection using Large Vision-Language Models Jul 19, 2024 Fact Checking Fact Verification
— Unverified 0Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval Jul 17, 2024 Image-text Retrieval Object
Code Code Available 0How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval? Jul 10, 2024 Contrastive Learning Image-text Retrieval
— Unverified 0