| Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation | May 28, 2025 | Information RetrievalRetrieval | CodeCode Available | 0 |
| Rethinking Chunk Size For Long-Document Retrieval: A Multi-Dataset Analysis | May 27, 2025 | ChunkingInformation Retrieval | CodeCode Available | 0 |
| Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers | May 26, 2025 | Information Retrieval | CodeCode Available | 3 |
| REARANK: Reasoning Re-ranking Agent via Reinforcement Learning | May 26, 2025 | Data AugmentationInformation Retrieval | CodeCode Available | 1 |
| It's High Time: A Survey of Temporal Information Retrieval and Question Answering | May 26, 2025 | ArticlesInformation Retrieval | —Unverified | 0 |
| Anveshana: A New Benchmark Dataset for Cross-Lingual Information Retrieval On English Queries and Sanskrit Documents | May 26, 2025 | Cross-Lingual Information RetrievalInformation Retrieval | —Unverified | 0 |
| Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval | May 26, 2025 | Contrastive Learningcross-modal alignment | CodeCode Available | 1 |
| DocMMIR: A Framework for Document Multi-modal Information Retrieval | May 25, 2025 | ArticlesCross-Modal Retrieval | CodeCode Available | 0 |
| DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research | May 25, 2025 | BenchmarkingInformation Retrieval | —Unverified | 0 |
| Likert or Not: LLM Absolute Relevance Judgments on Fine-Grained Ordinal Scales | May 25, 2025 | Information Retrieval | —Unverified | 0 |
| POQD: Performance-Oriented Query Decomposer for Multi-vector retrieval | May 25, 2025 | Information RetrievalRAG | CodeCode Available | 1 |
| Reinforcement Speculative Decoding for Fast Ranking | May 23, 2025 | Information RetrievalRecommendation Systems | —Unverified | 0 |
| Intent Classification on Low-Resource Languages with Query Similarity Search | May 23, 2025 | ClassificationInformation Retrieval | —Unverified | 0 |
| InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning | May 23, 2025 | Autonomous DrivingInformation Retrieval | —Unverified | 0 |
| Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty | May 22, 2025 | Information RetrievalRAG | —Unverified | 0 |
| Learning Normal Patterns in Musical Loops | May 22, 2025 | Anomaly DetectionInformation Retrieval | —Unverified | 0 |
| Tools in the Loop: Quantifying Uncertainty of LLM Question Answering Systems That Use Tools | May 22, 2025 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Don't "Overthink" Passage Reranking: Is Reasoning Truly Necessary? | May 22, 2025 | Information RetrievalPassage Reranking | —Unverified | 0 |
| Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation | May 22, 2025 | Common Sense ReasoningInformation Retrieval | —Unverified | 0 |
| Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval | May 22, 2025 | Information RetrievalRetrieval | —Unverified | 0 |
| MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries | May 22, 2025 | BenchmarkingInformation Retrieval | —Unverified | 0 |
| SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis | May 22, 2025 | DiversityInformation Retrieval | CodeCode Available | 4 |
| Layer-wise Investigation of Large-Scale Self-Supervised Music Representation Models | May 22, 2025 | Information RetrievalMusic Information Retrieval | —Unverified | 0 |
| Distance Adaptive Beam Search for Provably Accurate Graph-Based Nearest Neighbor Search | May 21, 2025 | Information Retrieval | CodeCode Available | 3 |
| MIRB: Mathematical Information Retrieval Benchmark | May 21, 2025 | Automated Theorem ProvingInformation Retrieval | CodeCode Available | 0 |
| DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster Management | May 20, 2025 | Decision MakingInformation Retrieval | CodeCode Available | 1 |
| Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation | May 20, 2025 | Information RetrievalKnowledge Distillation | —Unverified | 0 |
| Benchmarking the Myopic Trap: Positional Bias in Information Retrieval | May 20, 2025 | BenchmarkingInformation Retrieval | CodeCode Available | 5 |
| Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking | May 20, 2025 | Document RankingInformation Retrieval | —Unverified | 0 |
| NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search | May 20, 2025 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| JIR-Arena: The First Benchmark Dataset for Just-in-time Information Recommendation | May 19, 2025 | Information RetrievalRetrieval | CodeCode Available | 0 |
| Unified Cross-modal Translation of Score Images, Symbolic Music, and Performance Audio | May 19, 2025 | Audio GenerationInformation Retrieval | —Unverified | 0 |
| UniMoCo: Unified Modality Completion for Robust Multi-Modal Embeddings | May 17, 2025 | Image to textInformation Retrieval | CodeCode Available | 0 |
| Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning | May 16, 2025 | HallucinationInformation Retrieval | —Unverified | 0 |
| Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents | May 16, 2025 | Information Retrieval | —Unverified | 0 |
| CRISP: Clustering Multi-Vector Representations for Denoising and Pruning | May 16, 2025 | ClusteringDenoising | —Unverified | 0 |
| mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs | May 16, 2025 | Information RetrievalKnowledge Graphs | CodeCode Available | 1 |
| On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms | May 16, 2025 | Information RetrievalPrediction | —Unverified | 0 |
| ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval | May 13, 2025 | Information RetrievalRetrieval | —Unverified | 0 |
| Lost in Transliteration: Bridging the Script Gap in Neural IR | May 13, 2025 | Information RetrievalRetrieval | —Unverified | 0 |
| TRAIL: Trace Reasoning and Agentic Issue Localization | May 13, 2025 | Information Retrieval | —Unverified | 0 |
| A Mamba-based Network for Semi-supervised Singing Melody Extraction Using Confidence Binary Regularization | May 13, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 |
| Hakim: Farsi Text Embedding Model | May 13, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Evaluating LLM Metrics Through Real-World Capabilities | May 13, 2025 | Code GenerationInformation Retrieval | —Unverified | 0 |
| MedEIR: A Specialized Medical Embedding Model for Enhanced Information Retrieval | May 12, 2025 | Information RetrievalRAG | —Unverified | 0 |
| ReCDAP: Relation-Based Conditional Diffusion with Attention Pooling for Few-Shot Knowledge Graph Completion | May 12, 2025 | Information RetrievalKnowledge Graph Completion | CodeCode Available | 1 |
| QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines | May 12, 2025 | Computational EfficiencyDiversity | —Unverified | 0 |
| Learning Music Audio Representations With Limited Data | May 9, 2025 | Information RetrievalMusic Information Retrieval | CodeCode Available | 0 |
| Artifact Sharing for Information Retrieval Research | May 8, 2025 | Information RetrievalRetrieval | —Unverified | 0 |
| QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public | May 8, 2025 | Information RetrievalRetrieval | —Unverified | 0 |