| ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities | Jul 19, 2024 | 4k8k | —Unverified | 0 |
| AuditNet: A Conversational AI-based Security Assistant [DEMO] | Jul 19, 2024 | RetrievalRetrieval-augmented Generation | —Unverified | 0 |
| RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering | Jul 19, 2024 | Domain GeneralizationForm | CodeCode Available | 2 |
| Unipa-GPT: Large Language Models for university-oriented QA in Italian | Jul 19, 2024 | ChatbotInformation Retrieval | CodeCode Available | 0 |
| PRAGyan -- Connecting the Dots in Tweets | Jul 18, 2024 | Decision MakingKnowledge Graphs | —Unverified | 0 |
| Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering with an Iterative Approach | Jul 18, 2024 | Multi-hop Question AnsweringQuestion Answering | —Unverified | 0 |
| Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Jul 18, 2024 | Decision MakingHallucination | —Unverified | 0 |
| Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark | Jul 18, 2024 | GPUImage Retrieval | CodeCode Available | 1 |
| Retrieval-Augmented Generation for Natural Language Processing: A Survey | Jul 18, 2024 | HallucinationRAG | —Unverified | 0 |
| Can Open-Source LLMs Compete with Commercial Models? Exploring the Few-Shot Performance of Current GPT Models in Biomedical Tasks | Jul 18, 2024 | In-Context LearningRAG | CodeCode Available | 0 |