| Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning | Jun 5, 2025 | Question AnsweringRAG | CodeCode Available | 0 |
| ECoRAG: Evidentiality-guided Compression for Long Context RAG | Jun 5, 2025 | Answer GenerationOpen-Domain Question Answering | CodeCode Available | 1 |
| Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning | Jun 5, 2025 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems | Jun 5, 2025 | BenchmarkingRAG | —Unverified | 0 |
| On Automating Security Policies with Contemporary LLMs | Jun 5, 2025 | In-Context LearningRAG | —Unverified | 0 |
| GEM: Empowering LLM for both Embedding Generation and Language Understanding | Jun 4, 2025 | DecoderLarge Language Model | —Unverified | 0 |
| Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science | Jun 4, 2025 | ArticlesCode Generation | CodeCode Available | 0 |
| Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning | Jun 4, 2025 | Retrieval-augmented Generation | CodeCode Available | 1 |
| Through the Stealth Lens: Rethinking Attacks and Defenses in RAG | Jun 4, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| Magic Mushroom: A Customizable Benchmark for Fine-grained Analysis of Retrieval Noise Erosion in RAG Systems | Jun 4, 2025 | DenoisingHallucination | —Unverified | 0 |