Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications Oct 29, 2024 Image Retrieval RAG
Code Code Available 25 HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation Apr 13, 2025 Multimodal Reasoning RAG
Code Code Available 25 Hello Again! LLM-powered Personalized Agent for Long-term Dialogue Jun 9, 2024 Response Generation Retrieval
Code Code Available 25 Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers Mar 22, 2024 Information Retrieval
Code Code Available 25 Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation Mar 3, 2025 Representation Learning Retrieval
Code Code Available 25 Huatuo-26M, a Large-scale Chinese Medical QA Dataset May 2, 2023 Language Modeling Language Modelling
Code Code Available 25 InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval Jan 4, 2023 Information Retrieval Retrieval
Code Code Available 25 BEVPlace: Learning LiDAR-based Place Recognition using Bird's Eye View Images Feb 28, 2023 Retrieval
Code Code Available 25 Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement Mar 31, 2025 Hallucination RAG
Code Code Available 25 GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval Jul 17, 2024 Decoder Image Enhancement
Code Code Available 25 All You Need to Know About Training Image Retrieval Models Mar 17, 2025 All Image Retrieval
Code Code Available 25 GiantMIDI-Piano: A large-scale MIDI dataset for classical piano music Oct 11, 2020 Information Retrieval Music Information Retrieval
Code Code Available 25 Global Features are All You Need for Image Retrieval and Reranking Aug 14, 2023 All Image Retrieval
Code Code Available 25 Generating Images with Multimodal Language Models May 26, 2023 Decoder Image Generation
Code Code Available 25 Generating Benchmarks for Factuality Evaluation of Language Models Jul 13, 2023 Language Modeling Language Modelling
Code Code Available 25 GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information Apr 19, 2023 In-Context Learning Retrieval
Code Code Available 25 ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents Feb 21, 2024 Active Learning Position
Code Code Available 25 An Autonomous GIS Agent Framework for Geospatial Data Retrieval Jul 13, 2024 Retrieval
Code Code Available 25 Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking Apr 12, 2024 Contrastive Learning Retrieval
Code Code Available 25 GENIUS: A Generative Framework for Universal Multimodal Search Mar 25, 2025 Information Retrieval Quantization
Code Code Available 25 BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models Apr 17, 2021 Argument Retrieval Benchmarking
Code Code Available 25 VeCLIP: Improving CLIP Training via Visual-enriched Captions Oct 11, 2023 Image-text Retrieval Retrieval
Code Code Available 25 FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions Mar 22, 2024 Information Retrieval Retrieval
Code Code Available 25 FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search May 20, 2021 Information Retrieval Retrieval
Code Code Available 25 BEBLID: Boosted efficient binary local image descriptor Feb 7, 2024 Computational Efficiency Retrieval
Code Code Available 25 Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts Feb 24, 2025 Benchmarking Fact Verification
Code Code Available 25 Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering Sep 29, 2023 Image to text Passage Retrieval
Code Code Available 25 Backtracing: Retrieving the Cause of the Query Mar 6, 2024 Information Retrieval Language Modeling
Code Code Available 25 FLAIR: VLM with Fine-grained Language-informed Image Representations Dec 4, 2024 Language Modeling Language Modelling
Code Code Available 25 Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval Jan 2, 2021 Claim Verification Question Answering
Code Code Available 25 AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval Apr 9, 2024 All Information Retrieval
Code Code Available 25 Benchmarking Large Language Models in Retrieval-Augmented Generation Sep 4, 2023 Benchmarking counterfactual
Code Code Available 25 Fine-grained Image Captioning with CLIP Reward May 26, 2022 Caption Generation Descriptive
Code Code Available 25 FedRAG: A Framework for Fine-Tuning Retrieval-Augmented Generation Systems Jun 10, 2025 RAG Retrieval
Code Code Available 25 Atlas: Few-shot Learning with Retrieval Augmented Language Models Aug 5, 2022 Fact Checking Few-Shot Learning
Code Code Available 25 Active Retrieval Augmented Generation May 11, 2023 Retrieval Retrieval-augmented Generation
Code Code Available 25 Autoregressive Search Engines: Generating Substrings as Document Identifiers Apr 22, 2022 Information Retrieval Retrieval
Code Code Available 25 Flow-Guided Transformer for Video Inpainting Aug 14, 2022 Retrieval Video Inpainting
Code Code Available 25 Grounding Language Models to Images for Multimodal Inputs and Outputs Jan 31, 2023 Image Retrieval In-Context Learning
Code Code Available 25 INQUIRE: A Natural World Text-to-Image Retrieval Benchmark Nov 4, 2024 Image Retrieval Reranking
Code Code Available 25 All in One: Exploring Unified Video-Language Pre-training Mar 14, 2022 All Language Modelling
Code Code Available 25 All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed Audio Jul 31, 2023 All Downbeat Tracking
Code Code Available 25 Extended Mind Transformers Jun 4, 2024 Common Sense Reasoning counterfactual
Code Code Available 25 AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML Oct 3, 2024 AutoML Code Generation
Code Code Available 25 GLAP: General contrastive audio-text pretraining across domains and languages Jun 12, 2025 AudioCaps Keyword Spotting
Code Code Available 25 Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval Apr 21, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 25 Autonomous GIS: the next-generation AI-powered GIS May 10, 2023 Code Generation Information Retrieval
Code Code Available 25 Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a Benchmark Mar 12, 2025 Image Retrieval Retrieval
Code Code Available 25 Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework Jun 20, 2024 Hallucination Question Answering
Code Code Available 25 Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation May 22, 2024 Informativeness Language Modeling
Code Code Available 25