Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models Sep 7, 2024 Chunking Retrieval
Code Code Available 35 M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models Mar 31, 2024 Image-text Retrieval Language Modeling
Code Code Available 35 InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval Jan 4, 2023 Information Retrieval Retrieval
Code Code Available 25 InPars: Data Augmentation for Information Retrieval using Large Language Models Feb 10, 2022 Data Augmentation Diversity
Code Code Available 25 Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning Jan 25, 2025 Answer Generation Multi-agent Reinforcement Learning
Code Code Available 25 InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information Retrieval Jul 10, 2023 GPU Information Retrieval
Code Code Available 25 INQUIRE: A Natural World Text-to-Image Retrieval Benchmark Nov 4, 2024 Image Retrieval Reranking
Code Code Available 25 Benchmarking Large Language Models in Retrieval-Augmented Generation Sep 4, 2023 Benchmarking counterfactual
Code Code Available 25 BEBLID: Boosted efficient binary local image descriptor Feb 7, 2024 Computational Efficiency Retrieval
Code Code Available 25 BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models Apr 17, 2021 Argument Retrieval Benchmarking
Code Code Available 25 Huatuo-26M, a Large-scale Chinese Medical QA Dataset May 2, 2023 Language Modeling Language Modelling
Code Code Available 25 Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval Jan 2, 2021 Claim Verification Question Answering
Code Code Available 25 Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing Jul 1, 2024 Denoising Image Restoration
Code Code Available 25 InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales Jun 19, 2024 Denoising In-Context Learning
Code Code Available 25 HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation Apr 13, 2025 Multimodal Reasoning RAG
Code Code Available 25 Hello Again! LLM-powered Personalized Agent for Long-term Dialogue Jun 9, 2024 Response Generation Retrieval
Code Code Available 25 Hopfield Networks is All You Need Jul 16, 2020 All Drug Design
Code Code Available 25 Backtracing: Retrieving the Cause of the Query Mar 6, 2024 Information Retrieval Language Modeling
Code Code Available 25 HourVideo: 1-Hour Video-Language Understanding Nov 7, 2024 Benchmarking counterfactual
Code Code Available 25 All in One: Exploring Unified Video-Language Pre-training Mar 14, 2022 All Language Modelling
Code Code Available 25 Autoregressive Search Engines: Generating Substrings as Document Identifiers Apr 22, 2022 Information Retrieval Retrieval
Code Code Available 25 AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML Oct 3, 2024 AutoML Code Generation
Code Code Available 25 GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval Jul 17, 2024 Decoder Image Enhancement
Code Code Available 25 All You Need to Know About Training Image Retrieval Models Mar 17, 2025 All Image Retrieval
Code Code Available 25 Autonomous GIS: the next-generation AI-powered GIS May 10, 2023 Code Generation Information Retrieval
Code Code Available 25 Global Features are All You Need for Image Retrieval and Reranking Aug 14, 2023 All Image Retrieval
Code Code Available 25 How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions Jul 6, 2024 Question Answering RAG
Code Code Available 25 Interactive Continual Learning: Fast and Slow Thinking Mar 5, 2024 Continual Learning Outlier Detection
Code Code Available 25 Generating Images with Multimodal Language Models May 26, 2023 Decoder Image Generation
Code Code Available 25 All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed Audio Jul 31, 2023 All Downbeat Tracking
Code Code Available 25 Generating Benchmarks for Factuality Evaluation of Language Models Jul 13, 2023 Language Modeling Language Modelling
Code Code Available 25 Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation May 22, 2024 Informativeness Language Modeling
Code Code Available 25 Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking Apr 12, 2024 Contrastive Learning Retrieval
Code Code Available 25 GENIUS: A Generative Framework for Universal Multimodal Search Mar 25, 2025 Information Retrieval Quantization
Code Code Available 25 Grounding Language Models to Images for Multimodal Inputs and Outputs Jan 31, 2023 Image Retrieval In-Context Learning
Code Code Available 25 AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval Apr 9, 2024 All Information Retrieval
Code Code Available 25 VeCLIP: Improving CLIP Training via Visual-enriched Captions Oct 11, 2023 Image-text Retrieval Retrieval
Code Code Available 25 GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information Apr 19, 2023 In-Context Learning Retrieval
Code Code Available 25 FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions Mar 22, 2024 Information Retrieval Retrieval
Code Code Available 25 Flow-Guided Transformer for Video Inpainting Aug 14, 2022 Retrieval Video Inpainting
Code Code Available 25 FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search May 20, 2021 Information Retrieval Retrieval
Code Code Available 25 AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark Dec 17, 2024 Information Retrieval Retrieval
Code Code Available 25 Fine-grained Image Captioning with CLIP Reward May 26, 2022 Caption Generation Descriptive
Code Code Available 25 Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs May 16, 2025 Retrieval
Code Code Available 25 Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering Sep 29, 2023 Image to text Passage Retrieval
Code Code Available 25 Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion May 4, 2022 Information Retrieval Knowledge Graph Completion
Code Code Available 25 Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts Feb 24, 2025 Benchmarking Fact Verification
Code Code Available 25 Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models Jan 27, 2024 Medical Question Answering Multiple-choice
Code Code Available 25 In-Context Retrieval-Augmented Language Models Jan 31, 2023 Language Modeling Language Modelling
Code Code Available 25 AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models Nov 28, 2024 Audio captioning Audio to Text Retrieval
Code Code Available 25