| Lynx: An Open Source Hallucination Evaluation Model | Jul 11, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting | Jul 11, 2024 | ARCRAG | —Unverified | 0 |
| Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024 | Jul 11, 2024 | Natural Language UnderstandingRAG | —Unverified | 0 |
| Beyond Benchmarks: Evaluating Embedding Model Similarity for Retrieval Augmented Generation Systems | Jul 11, 2024 | Information RetrievalModel Selection | CodeCode Available | 0 |
| Examining Long-Context Large Language Models for Environmental Review Document Comprehension | Jul 10, 2024 | Question AnsweringRAG | —Unverified | 0 |
| FACTS About Building Retrieval Augmented Generation-based Chatbots | Jul 10, 2024 | RAGReranking | —Unverified | 0 |
| A Simple Architecture for Enterprise Large Language Model Applications based on Role based security and Clearance Levels using Retrieval-Augmented Generation or Mixture of Experts | Jul 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models | Jul 7, 2024 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| RAMO: Retrieval-Augmented Generation for Enhancing MOOCs Recommendations | Jul 6, 2024 | RAGRecommendation Systems | —Unverified | 0 |
| Are LLMs Correctly Integrated into Software Systems? | Jul 6, 2024 | ManagementRAG | —Unverified | 0 |
| GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning | Jul 5, 2024 | parameter-efficient fine-tuningRAG | —Unverified | 0 |
| EventChat: Implementation and user-centric evaluation of a large language model-driven conversational recommender system for exploring leisure events in an SME context | Jul 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge | Jul 5, 2024 | Instance SegmentationOptical Character Recognition (OCR) | —Unverified | 0 |
| Automated C/C++ Program Repair for High-Level Synthesis via Large Language Models | Jul 4, 2024 | C++ codeCode Generation | —Unverified | 0 |
| NutriBench: A Dataset for Evaluating Large Language Models on Nutrition Estimation from Meal Descriptions | Jul 4, 2024 | NutritionRetrieval-augmented Generation | —Unverified | 0 |
| CaseGPT: a case reasoning framework based on language models and retrieval-augmented generation | Jul 4, 2024 | RAGRetrieval | —Unverified | 0 |
| DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation | Jul 4, 2024 | RAGRe-Ranking | —Unverified | 0 |
| Meta-prompting Optimized Retrieval-augmented Generation | Jul 4, 2024 | Multi-hop Question AnsweringQuestion Answering | —Unverified | 0 |
| Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning | Jul 3, 2024 | In-Context LearningRetrieval | —Unverified | 0 |
| A Comparative Study of DSL Code Generation: Fine-Tuning vs. Optimized Retrieval Augmentation | Jul 3, 2024 | Code GenerationHallucination | —Unverified | 0 |
| RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs | Jul 2, 2024 | Answer GenerationQuestion Answering | —Unverified | 0 |
| Synthetic Multimodal Question Generation | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions | Jul 2, 2024 | In-Context LearningRAG | CodeCode Available | 0 |
| Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Face4RAG: Factual Consistency Evaluation for Retrieval Augmented Generation in Chinese | Jul 1, 2024 | RAGRetrieval | —Unverified | 0 |
| Optimization of Retrieval-Augmented Generation Context with Outlier Detection | Jul 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring Advanced Large Language Models with LLMsuite | Jul 1, 2024 | RAGRetrieval | —Unverified | 0 |
| Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation | Jul 1, 2024 | Fact CheckingLong Form Question Answering | —Unverified | 0 |
| Hybrid RAG-empowered Multi-modal LLM for Secure Data Management in Internet of Medical Things: A Diffusion-based Contract Approach | Jul 1, 2024 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| SecGenAI: Enhancing Security of Cloud-based Generative AI Applications within Australian Critical Technologies of National Interest | Jul 1, 2024 | EthicsRAG | —Unverified | 0 |
| Memory^3: Language Modeling with Explicit Memory | Jul 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models Struggle in Token-Level Clinical Named Entity Recognition | Jun 30, 2024 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Answering real-world clinical questions using large language model based systems | Jun 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science | Jun 29, 2024 | AI AgentClaim Verification | CodeCode Available | 0 |
| SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs | Jun 28, 2024 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| LLM4DESIGN: An Automated Multi-Modal System for Architectural and Environmental Design | Jun 28, 2024 | RAGRetrieval | —Unverified | 0 |
| RAVEN: Multitask Retrieval Augmented Vision-Language Learning | Jun 27, 2024 | Image CaptioningRAG | —Unverified | 0 |
| Development and Evaluation of a Retrieval-Augmented Generation Tool for Creating SAPPhIRE Models of Artificial Systems | Jun 27, 2024 | RAGRetrieval | —Unverified | 0 |
| Generating Is Believing: Membership Inference Attacks against Retrieval-Augmented Generation | Jun 27, 2024 | RAGRetrieval | —Unverified | 0 |
| Which Neurons Matter in IR? Applying Integrated Gradients-based Methods to Understand Cross-Encoders | Jun 27, 2024 | Information RetrievalRAG | —Unverified | 0 |
| AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation | Jun 27, 2024 | AutoMLEfficient Exploration | —Unverified | 0 |
| AI-native Memory: A Pathway from LLMs Towards AGI | Jun 26, 2024 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| Multi-step Inference over Unstructured Data | Jun 26, 2024 | Decision MakingRAG | —Unverified | 0 |
| Evaluating Quality of Answers for Retrieval-Augmented Generation: A Strong LLM Is All You Need | Jun 26, 2024 | AllRAG | —Unverified | 0 |
| Poisoned LangChain: Jailbreak LLMs by LangChain | Jun 26, 2024 | RAGRetrieval | —Unverified | 0 |
| Assessing "Implicit" Retrieval Robustness of Large Language Models | Jun 26, 2024 | RetrievalRetrieval-augmented Generation | —Unverified | 0 |
| Software Model Evolution with Large Language Models: Experiments on Simulated, Public, and Industrial Datasets | Jun 25, 2024 | RetrievalRetrieval-augmented Generation | CodeCode Available | 0 |
| RAGBench: Explainable Benchmark for Retrieval-Augmented Generation Systems | Jun 25, 2024 | BenchmarkingRAG | —Unverified | 0 |
| Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model | Jun 24, 2024 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| On the Role of Long-tail Knowledge in Retrieval Augmented Large Language Models | Jun 24, 2024 | RAGRetrieval | —Unverified | 0 |