Improving Consistency in Large Language Models through Chain of Guidance Feb 21, 2025 Question Answering
Code Code Available 0Argument-Based Comparative Question Answering Evaluation Benchmark Feb 20, 2025 Question Answering
— Unverified 0MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models Feb 20, 2025 Decision Making Hallucination
— Unverified 0Effects of Prompt Length on Domain-specific Tasks for Large Language Models Feb 20, 2025 Machine Translation Prompt Engineering
— Unverified 0Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework Feb 20, 2025 Benchmarking Question Answering
Code Code Available 0Is Relevance Propagated from Retriever to Generator in RAG? Feb 20, 2025 Large Language Model Question Answering
— Unverified 0NLP-AKG: Few-Shot Construction of NLP Academic Knowledge Graph Based on LLM Feb 20, 2025 graph construction Question Answering
— Unverified 0How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Feb 20, 2025 Question Answering
Code Code Available 0EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts Feb 20, 2025 16k Decoder
— Unverified 0Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering Feb 20, 2025 Answer Generation Multi-hop Question Answering
— Unverified 0Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison Feb 20, 2025 Diversity Language Modeling
— Unverified 0On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems Feb 20, 2025 Long Form Question Answering Question Answering
Code Code Available 0Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests Feb 20, 2025 Logical Reasoning MMLU
— Unverified 0MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering Feb 19, 2025 Decision Making Knowledge Base Question Answering
— Unverified 0Navigating Semantic Relations: Challenges for Language Models in Abstract Common-Sense Reasoning Feb 19, 2025 Common Sense Reasoning Mathematical Problem-Solving
— Unverified 0RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering Feb 19, 2025 Decision Making Language Modeling
— Unverified 0Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above Feb 19, 2025 All Multiple-choice
— Unverified 0Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering Feb 19, 2025 Question Answering
Code Code Available 0DH-RAG: A Dynamic Historical Context-Powered Retrieval-Augmented Generation Method for Multi-Turn Dialogue Feb 19, 2025 Question Answering RAG
— Unverified 0REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models Feb 19, 2025 Hallucination Language Modeling
— Unverified 0Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning Feb 19, 2025 Autonomous Driving Bench2Drive
— Unverified 0PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language Models Feb 19, 2025 Open-Ended Question Answering Privacy Preserving
Code Code Available 0MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads Feb 19, 2025 Contrastive Learning Question Answering
Code Code Available 0Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models Feb 19, 2025 Memorization Question Answering
— Unverified 0PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery Feb 19, 2025 Question Answering Visual Question Answering
Code Code Available 0TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition Feb 19, 2025 Answer Generation Form
— Unverified 0Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented Generation Feb 19, 2025 Question Answering RAG
— Unverified 0Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge Feb 18, 2025 Graph Generation Knowledge Graphs
— Unverified 0Towards an automated workflow in materials science for combining multi-modal simulative and experimental information using data mining and large language models Feb 18, 2025 Information Retrieval Large Language Model
— Unverified 0SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models Feb 18, 2025 Image Comprehension Question Answering
— Unverified 0Grounding LLM Reasoning with Knowledge Graphs Feb 18, 2025 Knowledge Graphs Question Answering
— Unverified 0Beyond Seen Data: Improving KBQA Generalization Through Schema-Guided Logical Form Generation Feb 18, 2025 Entity Retrieval Form
Code Code Available 0Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs Feb 18, 2025 Generative Question Answering Multiple-choice
— Unverified 0SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering? Feb 18, 2025 Medical Question Answering Question Answering
— Unverified 0Multilingual European Language Models: Benchmarking Approaches and Challenges Feb 18, 2025 Benchmarking Question Answering
— Unverified 0Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization Feb 18, 2025 Information Retrieval Medical Question Answering
— Unverified 0LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data Feb 18, 2025 Misinformation Question Answering
Code Code Available 0How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild Feb 18, 2025 Articles Hallucination
Code Code Available 0Savaal: Scalable Concept-Driven Question Generation to Enhance Human Learning Feb 18, 2025 Question Answering Question Generation
— Unverified 0The geometry of BERT Feb 17, 2025 Question Answering Text Summarization
— Unverified 0Ontology-Guided Reverse Thinking Makes Large Language Models Stronger on Knowledge Graph Question Answering Feb 17, 2025 Graph Question Answering Question Answering
— Unverified 0RAG vs. GraphRAG: A Systematic Evaluation and Key Insights Feb 17, 2025 Knowledge Graphs Question Answering
— Unverified 0RA-MTR: A Retrieval Augmented Multi-Task Reader based Approach for Inspirational Quote Extraction from Long Documents Feb 17, 2025 Articles Open-Domain Question Answering
Code Code Available 0Multi-Modal Retrieval Augmentation for Open-Ended and Knowledge-Intensive Video Question Answering Feb 17, 2025 Multiple-choice Question Answering
— Unverified 0Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning Feb 17, 2025 In-Context Learning Multimodal Reasoning
Code Code Available 0"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models Feb 17, 2025 Object Recognition Question Answering
— Unverified 0LM Agents for Coordinating Multi-User Information Gathering Feb 17, 2025 Document Summarization Multi-Document Summarization
— Unverified 0The Rotary Position Embedding May Cause Dimension Inefficiency in Attention Heads for Long-Distance Retrieval Feb 16, 2025 Position Question Answering
— Unverified 0Vendi-RAG: Adaptively Trading-Off Diversity And Quality Significantly Improves Retrieval Augmented Generation With LLMs Feb 16, 2025 Diversity Question Answering
— Unverified 0QuOTE: Question-Oriented Text Embeddings Feb 16, 2025 Multi-hop Question Answering Question Answering
— Unverified 0