| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Jun 14, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 |
| HIRO: Hierarchical Information Retrieval Optimization | Jun 14, 2024 | Information RetrievalRAG | CodeCode Available | 0 |
| Hyperdimensional Quantum Factorization | Jun 13, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Enhancing Knowledge Retrieval with In-Context Learning and Semantic Search through Generative AI | Jun 13, 2024 | In-Context LearningInformation Retrieval | —Unverified | 0 |
| Robust Information Retrieval | Jun 13, 2024 | Adversarial RobustnessInformation Retrieval | —Unverified | 0 |
| Prediction of the Realisation of an Information Need: An EEG Study | Jun 12, 2024 | EEGInformation Retrieval | —Unverified | 0 |
| Text Information Retrieval in Tetun: A Preliminary Study | Jun 11, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Evaluating the Retrieval Component in LLM-Based Question Answering Systems | Jun 10, 2024 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Synthetic Query Generation using Large Language Models for Virtual Assistants | Jun 10, 2024 | Information Retrievalspeech-recognition | —Unverified | 0 |
| Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval | Jun 10, 2024 | Inference OptimizationInformation Retrieval | —Unverified | 0 |
| Transforming Wearable Data into Health Insights using Large Language Model Agents | Jun 10, 2024 | Code GenerationInformation Retrieval | —Unverified | 0 |
| MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing | Jun 10, 2024 | Information RetrievalMusic Information Retrieval | CodeCode Available | 1 |
| MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking Model | Jun 9, 2024 | Information RetrievalLearning-To-Rank | —Unverified | 0 |
| The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More | Jun 7, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Corpus Poisoning via Approximate Greedy Gradient Descent | Jun 7, 2024 | Information RetrievalRAG | CodeCode Available | 0 |
| ComplexTempQA: A Large-Scale Dataset for Complex Temporal Question Answering | Jun 7, 2024 | Information RetrievalQuestion Answering | CodeCode Available | 1 |
| PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction | Jun 7, 2024 | Image GenerationImage Retrieval | CodeCode Available | 0 |
| Reducing the climate impact of data portals: a case study | Jun 6, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Measuring and Addressing Indexical Bias in Information Retrieval | Jun 6, 2024 | 4kFairness | CodeCode Available | 0 |
| Meta-learning for Positive-unlabeled Classification | Jun 6, 2024 | ClassificationDensity Ratio Estimation | —Unverified | 0 |
| The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches | Jun 5, 2024 | ChatbotInformation Retrieval | —Unverified | 0 |
| MidiCaps: A large-scale MIDI dataset with text captions | Jun 4, 2024 | Information RetrievalMusic Information Retrieval | CodeCode Available | 2 |
| Explainable Deep Learning Analysis for Raga Identification in Indian Art Music | Jun 4, 2024 | Information RetrievalMusic Information Retrieval | CodeCode Available | 0 |
| SoccerRAG: Multimodal Soccer Information Retrieval via Natural Queries | Jun 3, 2024 | Information RetrievalNatural Language Queries | CodeCode Available | 0 |
| A Survey of Generative Information Retrieval | Jun 3, 2024 | Information RetrievalMulti-Task Learning | CodeCode Available | 0 |
| Demo: Soccer Information Retrieval via Natural Queries using SoccerRAG | Jun 3, 2024 | ChatbotInformation Retrieval | CodeCode Available | 0 |
| LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models | Jun 2, 2024 | Continual PretrainingInformation Retrieval | —Unverified | 0 |
| Developing an efficient corpus using Ensemble Data cleaning approach | Jun 2, 2024 | Information Retrieval | —Unverified | 0 |
| COS-Mix: Cosine Similarity and Distance Fusion for Improved Information Retrieval | Jun 2, 2024 | Information RetrievalRAG | CodeCode Available | 4 |
| Large Language Models for Relevance Judgment in Product Search | Jun 1, 2024 | AttributeInformation Retrieval | —Unverified | 0 |
| LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking | May 31, 2024 | In-Context LearningInformation Retrieval | CodeCode Available | 0 |
| Jina CLIP: Your CLIP Model Is Also Your Text Retriever | May 30, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Toward Conversational Agents with Context and Time Sensitive Long-term Memory | May 29, 2024 | FormInformation Retrieval | CodeCode Available | 1 |
| MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings | May 29, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 |
| Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action | May 28, 2024 | Conversational Question AnsweringHallucination | —Unverified | 0 |
| ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator | May 28, 2024 | Information RetrievalLanguage Modelling | CodeCode Available | 0 |
| NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models | May 27, 2024 | Information RetrievalLanguage Modelling | —Unverified | 0 |
| KSW: Khmer Stop Word based Dictionary for Keyword Extraction | May 27, 2024 | Information RetrievalKeyword Extraction | CodeCode Available | 1 |
| Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration | May 26, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 |
| Disentangling and Integrating Relational and Sensory Information in Transformer Architectures | May 26, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Automatic Jailbreaking of the Text-to-Image Generative AI Systems | May 26, 2024 | Image GenerationInformation Retrieval | CodeCode Available | 1 |
| Shopping Queries Image Dataset (SQID): An Image-Enriched ESCI Dataset for Exploring Multimodal Learning in Product Search | May 24, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 |
| ASI++: Towards Distributionally Balanced End-to-End Generative Retrieval | May 23, 2024 | Information RetrievalQuantization | —Unverified | 0 |
| Top-Down Partitioning for Efficient List-Wise Ranking | May 23, 2024 | Information RetrievalRe-Ranking | CodeCode Available | 0 |
| Retrieval-Augmented Mining of Temporal Logic Specifications from Data | May 23, 2024 | Bayesian OptimizationBinary Classification | —Unverified | 0 |
| A Workbench for Autograding Retrieve/Generate Systems | May 21, 2024 | DiversityInformation Retrieval | CodeCode Available | 0 |
| Efficient and Interpretable Information Retrieval for Product Question Answering with Heterogeneous Data | May 21, 2024 | Contrastive LearningInformation Retrieval | CodeCode Available | 0 |
| Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering | May 21, 2024 | DiversityInformation Retrieval | CodeCode Available | 0 |
| Thesis: Document Summarization with applications to Keyword extraction and Image Retrieval | May 20, 2024 | ArticlesDocument Summarization | —Unverified | 0 |
| CaseGNN++: Graph Contrastive Learning for Legal Case Retrieval with Graph Augmentation | May 20, 2024 | Contrastive LearningGraph Attention | CodeCode Available | 1 |