| Developing an efficient corpus using Ensemble Data cleaning approach | Jun 2, 2024 | Information Retrieval | —Unverified | 0 |
| LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models | Jun 2, 2024 | Continual PretrainingInformation Retrieval | —Unverified | 0 |
| Large Language Models for Relevance Judgment in Product Search | Jun 1, 2024 | AttributeInformation Retrieval | —Unverified | 0 |
| LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking | May 31, 2024 | In-Context LearningInformation Retrieval | CodeCode Available | 0 |
| Jina CLIP: Your CLIP Model Is Also Your Text Retriever | May 30, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator | May 28, 2024 | Information RetrievalLanguage Modelling | CodeCode Available | 0 |
| Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action | May 28, 2024 | Conversational Question AnsweringHallucination | —Unverified | 0 |
| NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models | May 27, 2024 | Information RetrievalLanguage Modelling | —Unverified | 0 |
| Disentangling and Integrating Relational and Sensory Information in Transformer Architectures | May 26, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Retrieval-Augmented Mining of Temporal Logic Specifications from Data | May 23, 2024 | Bayesian OptimizationBinary Classification | —Unverified | 0 |
| Top-Down Partitioning for Efficient List-Wise Ranking | May 23, 2024 | Information RetrievalRe-Ranking | CodeCode Available | 0 |
| ASI++: Towards Distributionally Balanced End-to-End Generative Retrieval | May 23, 2024 | Information RetrievalQuantization | —Unverified | 0 |
| A Workbench for Autograding Retrieve/Generate Systems | May 21, 2024 | DiversityInformation Retrieval | CodeCode Available | 0 |
| Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering | May 21, 2024 | DiversityInformation Retrieval | CodeCode Available | 0 |
| Efficient and Interpretable Information Retrieval for Product Question Answering with Heterogeneous Data | May 21, 2024 | Contrastive LearningInformation Retrieval | CodeCode Available | 0 |
| DLAFormer: An End-to-End Transformer For Document Layout Analysis | May 20, 2024 | Document Layout AnalysisDocument Summarization | —Unverified | 0 |
| KG-RAG: Bridging the Gap Between Knowledge and Creativity | May 20, 2024 | Graph Question AnsweringInformation Retrieval | —Unverified | 0 |
| Effective Clustering on Large Attributed Bipartite Graphs | May 20, 2024 | AttributeClustering | CodeCode Available | 0 |
| Thesis: Document Summarization with applications to Keyword extraction and Image Retrieval | May 20, 2024 | ArticlesDocument Summarization | —Unverified | 0 |
| Unifying 3D Vision-Language Understanding via Promptable Queries | May 19, 2024 | 3D Question Answering (3D-QA)Decoder | —Unverified | 0 |
| Sociotechnical Implications of Generative Artificial Intelligence for Information Access | May 19, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| The First Swahili Language Scene Text Detection and Recognition Dataset | May 19, 2024 | Information RetrievalScene Text Detection | CodeCode Available | 0 |
| On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions | May 19, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search | May 18, 2024 | Information RetrievalManagement | —Unverified | 0 |
| INDUS: Effective and Efficient Language Models for Scientific Applications | May 17, 2024 | Contrastive LearningInformation Retrieval | —Unverified | 0 |
| Words Blending Boxes. Obfuscating Queries in Information Retrieval using Differential Privacy | May 15, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues | May 15, 2024 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Modeling User Preferences via Brain-Computer Interfacing | May 15, 2024 | Information Retrieval | —Unverified | 0 |
| Error-margin Analysis for Hidden Neuron Activation Labels | May 14, 2024 | Information RetrievalRetrieval | CodeCode Available | 0 |
| LGDE: Local Graph-based Dictionary Expansion | May 13, 2024 | Community DetectionInformation Retrieval | CodeCode Available | 0 |
| Identifying Key Terms in Prompts for Relevance Evaluation with GPT Models | May 11, 2024 | Information Retrieval | —Unverified | 0 |
| Event GDR: Event-Centric Generative Document Retrieval | May 11, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Explaining Text Similarity in Transformer Models | May 10, 2024 | Information RetrievalRetrieval | CodeCode Available | 0 |
| A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models | May 10, 2024 | Information RetrievalRAG | —Unverified | 0 |
| Redefining Information Retrieval of Structured Database via Large Language Models | May 9, 2024 | Information RetrievalQuestion Answering | —Unverified | 0 |
| ChatSOS: Vector Database Augmented Generative Question Answering Assistant in Safety Engineering | May 8, 2024 | Generative Question AnsweringInformation Retrieval | —Unverified | 0 |
| Full Stage Learning to Rank: A Unified Framework for Multi-Stage Systems | May 8, 2024 | Information RetrievalLearning-To-Rank | —Unverified | 0 |
| LLMs Can Patch Up Missing Relevance Judgments in Evaluation | May 8, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages | May 8, 2024 | Information RetrievalMachine Translation | —Unverified | 0 |
| CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion | May 7, 2024 | Information RetrievalKnowledge Graphs | CodeCode Available | 0 |
| MedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineering | May 4, 2024 | Information RetrievalLarge Language Model | —Unverified | 0 |
| R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large Language Models | May 4, 2024 | Graph AttentionHallucination | —Unverified | 0 |
| Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness | May 4, 2024 | Information RetrievalRetrieval | CodeCode Available | 0 |
| IQLS: Framework for leveraging Metadata to enable Large Language Model based queries to complex, versatile Data | May 4, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Semi-Parametric Retrieval via Binary Bag-of-Tokens Index | May 3, 2024 | Information RetrievalOpen-Domain Question Answering | CodeCode Available | 0 |
| SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India | May 3, 2024 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Comparative Analysis of Retrieval Systems in the Real World | May 3, 2024 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Distillation for Multilingual Information Retrieval | May 2, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Question Suggestion for Conversational Shopping Assistants Using Product Metadata | May 2, 2024 | FrictionIn-Context Learning | —Unverified | 0 |
| "In-Context Learning" or: How I learned to stop worrying and love "Applied Information Retrieval" | May 2, 2024 | In-Context LearningInformation Retrieval | —Unverified | 0 |