| A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Mar 7, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 2 | 5 |
| GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity? | Feb 7, 2025 | 8kInformation Retrieval | CodeCode Available | 2 | 5 |
| Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis | Jan 22, 2024 | Document Layout AnalysisDocument Summarization | CodeCode Available | 2 | 5 |
| GiantMIDI-Piano: A large-scale MIDI dataset for classical piano music | Oct 11, 2020 | Information RetrievalMusic Information Retrieval | CodeCode Available | 2 | 5 |
| The GigaMIDI Dataset with Features for Expressive Music Performance Detection | Feb 24, 2025 | Information RetrievalMusic Information Retrieval | CodeCode Available | 2 | 5 |
| The Power of Noise: Redefining Retrieval for RAG Systems | Jan 26, 2024 | Information RetrievalRAG | CodeCode Available | 2 | 5 |
| Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion | May 4, 2022 | Information RetrievalKnowledge Graph Completion | CodeCode Available | 2 | 5 |
| FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search | May 20, 2021 | Information RetrievalRetrieval | CodeCode Available | 2 | 5 |
| UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers | Mar 1, 2023 | Domain AdaptationInformation Retrieval | CodeCode Available | 2 | 5 |
| Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM Era | Apr 17, 2024 | FairnessInformation Retrieval | CodeCode Available | 2 | 5 |
| AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark | Dec 17, 2024 | Information RetrievalRetrieval | CodeCode Available | 2 | 5 |
| Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers | Mar 22, 2024 | Information Retrieval | CodeCode Available | 2 | 5 |
| Infinite Recommendation Networks: A Data-Centric Approach | Jun 3, 2022 | Information RetrievalRecommendation Systems | CodeCode Available | 2 | 5 |
| FinBERT-QA: Financial Question Answering with pre-trained BERT Language Models | Apr 24, 2025 | Answer SelectionInformation Retrieval | CodeCode Available | 2 | 5 |
| BIRB: A Generalization Benchmark for Information Retrieval in Bioacoustics | Dec 12, 2023 | Information RetrievalRepresentation Learning | CodeCode Available | 2 | 5 |
| FIRST: Faster Improved Listwise Reranking with Single Token Decoding | Jun 21, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 2 | 5 |
| Eureka: Evaluating and Understanding Large Foundation Models | Sep 13, 2024 | Information Retrieval | CodeCode Available | 2 | 5 |
| A Foundation Model for Music Informatics | Nov 6, 2023 | Information Retrievalmodel | CodeCode Available | 2 | 5 |
| Evaluation of Retrieval-Augmented Generation: A Survey | May 13, 2024 | Information RetrievalRAG | CodeCode Available | 2 | 5 |
| ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT | Apr 27, 2020 | Document RankingInformation Retrieval | CodeCode Available | 2 | 5 |
| FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions | Mar 22, 2024 | Information RetrievalRetrieval | CodeCode Available | 2 | 5 |
| Autoregressive Search Engines: Generating Substrings as Document Identifiers | Apr 22, 2022 | Information RetrievalRetrieval | CodeCode Available | 2 | 5 |
| InPars: Data Augmentation for Information Retrieval using Large Language Models | Feb 10, 2022 | Data AugmentationDiversity | CodeCode Available | 2 | 5 |
| Backtracing: Retrieving the Cause of the Query | Mar 6, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 2 | 5 |
| Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search | Nov 26, 2024 | Information Retrieval | CodeCode Available | 1 | 5 |
| EEG2Mel: Reconstructing Sound from Brain Responses to Music | Jul 28, 2022 | EEGElectroencephalogram (EEG) | CodeCode Available | 1 | 5 |
| Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better | Jun 16, 2021 | Deep LearningInformation Retrieval | CodeCode Available | 1 | 5 |
| A Data-Driven Methodology for Considering Feasibility and Pairwise Likelihood in Deep Learning Based Guitar Tablature Transcription Systems | Apr 17, 2022 | Information RetrievalMusic Information Retrieval | CodeCode Available | 1 | 5 |
| RACE: Retrieval-Augmented Commit Message Generation | Mar 5, 2022 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Efficient fine-tuning methodology of text embedding models for information retrieval: contrastive learning penalty (clp) | Dec 23, 2024 | Contrastive LearningInformation Retrieval | CodeCode Available | 1 | 5 |
| DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification | Mar 24, 2024 | Audio ClassificationInformation Retrieval | CodeCode Available | 1 | 5 |
| Back to the Basics: A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword Extraction | Apr 16, 2021 | Information RetrievalKeyword Extraction | CodeCode Available | 1 | 5 |
| Automatic Jailbreaking of the Text-to-Image Generative AI Systems | May 26, 2024 | Image GenerationInformation Retrieval | CodeCode Available | 1 | 5 |
| Do Music Generation Models Encode Music Theory? | Oct 1, 2024 | Emotion RecognitionGenre classification | CodeCode Available | 1 | 5 |
| Dynamic Modality Interaction Modeling for Image-Text Retrieval | Jul 11, 2021 | cross-modal alignmentCross-Modal Retrieval | CodeCode Available | 1 | 5 |
| Efficiently predicting high resolution mass spectra with graph neural networks | Jan 26, 2023 | Graph ClassificationInformation Retrieval | CodeCode Available | 1 | 5 |
| Adaptive Machine Translation with Large Language Models | Jan 30, 2023 | DecoderDomain Adaptation | CodeCode Available | 1 | 5 |
| Distilling Knowledge from Reader to Retriever for Question Answering | Dec 8, 2020 | Information RetrievalKnowledge Distillation | CodeCode Available | 1 | 5 |
| Dive into Decision Trees and Forests: A Theoretical Demonstration | Jan 20, 2021 | Information RetrievalRecommendation Systems | CodeCode Available | 1 | 5 |
| Automatic Generation of Topic Labels | May 29, 2020 | DescriptiveInformation Retrieval | CodeCode Available | 1 | 5 |
| Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking | Apr 4, 2025 | Document RankingInformation Retrieval | CodeCode Available | 1 | 5 |
| Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents | Mar 6, 2022 | Community Question AnsweringInformation Retrieval | CodeCode Available | 1 | 5 |
| Augmenting Document Representations for Dense Retrieval with Interpolation and Perturbation | Mar 15, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 1 | 5 |
| audioLIME: Listenable Explanations Using Source Separation | Aug 2, 2020 | Information RetrievalMusic Information Retrieval | CodeCode Available | 1 | 5 |
| DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster Management | May 20, 2025 | Decision MakingInformation Retrieval | CodeCode Available | 1 | 5 |
| Audio Embeddings as Teachers for Music Classification | Jun 30, 2023 | ClassificationInformation Retrieval | CodeCode Available | 1 | 5 |
| Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder | May 6, 2022 | Dimensionality ReductionInformation Retrieval | CodeCode Available | 1 | 5 |
| Discovering Mathematical Objects of Interest -- A Study of Mathematical Notations | Feb 7, 2020 | Information RetrievalMath | CodeCode Available | 1 | 5 |
| Injecting Domain Adaptation with Learning-to-hash for Effective and Efficient Zero-shot Dense Retrieval | May 23, 2022 | Ad-Hoc Information RetrievalCPU | CodeCode Available | 1 | 5 |
| Embed2Detect: Temporally Clustered Embedded Words for Event Detection in Social Media | Jun 10, 2020 | ClusteringEvent Detection | CodeCode Available | 1 | 5 |