How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild Feb 18, 2025 Articles Hallucination
Code Code Available 0CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space Feb 18, 2025 Embodied Question Answering Question Answering
Code Code Available 1Towards an automated workflow in materials science for combining multi-modal simulative and experimental information using data mining and large language models Feb 18, 2025 Information Retrieval Large Language Model
— Unverified 0Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization Feb 18, 2025 Information Retrieval Medical Question Answering
— Unverified 0Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Feb 18, 2025 Image Retrieval Question Answering
Code Code Available 2Grounding LLM Reasoning with Knowledge Graphs Feb 18, 2025 Knowledge Graphs Question Answering
— Unverified 0SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models Feb 18, 2025 Image Comprehension Question Answering
— Unverified 0LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data Feb 18, 2025 Misinformation Question Answering
Code Code Available 0Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs Feb 18, 2025 Generative Question Answering Multiple-choice
— Unverified 0Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge Feb 18, 2025 Graph Generation Knowledge Graphs
— Unverified 0SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering? Feb 18, 2025 Medical Question Answering Question Answering
— Unverified 0Beyond Seen Data: Improving KBQA Generalization Through Schema-Guided Logical Form Generation Feb 18, 2025 Entity Retrieval Form
Code Code Available 0LM Agents for Coordinating Multi-User Information Gathering Feb 17, 2025 Document Summarization Multi-Document Summarization
— Unverified 0Ontology-Guided Reverse Thinking Makes Large Language Models Stronger on Knowledge Graph Question Answering Feb 17, 2025 Graph Question Answering Question Answering
— Unverified 0Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning Feb 17, 2025 In-Context Learning Multimodal Reasoning
Code Code Available 0RA-MTR: A Retrieval Augmented Multi-Task Reader based Approach for Inspirational Quote Extraction from Long Documents Feb 17, 2025 Articles Open-Domain Question Answering
Code Code Available 0MMXU: A Multi-Modal and Multi-X-ray Understanding Dataset for Disease Progression Feb 17, 2025 Diagnostic Question Answering
Code Code Available 1The geometry of BERT Feb 17, 2025 Question Answering Text Summarization
— Unverified 0Multi-Modal Retrieval Augmentation for Open-Ended and Knowledge-Intensive Video Question Answering Feb 17, 2025 Multiple-choice Question Answering
— Unverified 0RAG vs. GraphRAG: A Systematic Evaluation and Key Insights Feb 17, 2025 Knowledge Graphs Question Answering
— Unverified 0"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models Feb 17, 2025 Object Recognition Question Answering
— Unverified 0RoseRAG: Robust Retrieval-augmented Generation with Small-scale LLMs via Margin-aware Preference Optimization Feb 16, 2025 Open-Domain Question Answering Question Answering
— Unverified 0Vendi-RAG: Adaptively Trading-Off Diversity And Quality Significantly Improves Retrieval Augmented Generation With LLMs Feb 16, 2025 Diversity Question Answering
— Unverified 0The Rotary Position Embedding May Cause Dimension Inefficiency in Attention Heads for Long-Distance Retrieval Feb 16, 2025 Position Question Answering
— Unverified 0The Mirage of Model Editing: Revisiting Evaluation in the Wild Feb 16, 2025 Model Editing Question Answering
Code Code Available 1QuOTE: Question-Oriented Text Embeddings Feb 16, 2025 Multi-hop Question Answering Question Answering
— Unverified 0Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems Feb 16, 2025 Open-Domain Question Answering Question Answering
Code Code Available 2K-Edit: Language Model Editing with Contextual Knowledge Awareness Feb 15, 2025 Knowledge Graphs Language Modeling
— Unverified 0NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question Answering Feb 15, 2025 Chunking Information Retrieval
Code Code Available 0SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding Feb 15, 2025 Question Answering Streaming video understanding
Code Code Available 2Insect-Foundation: A Foundation Model and Large Multimodal Dataset for Vision-Language Insect Understanding Feb 14, 2025 General Knowledge Question Answering
— Unverified 0Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering Feb 14, 2025 Mathematical Reasoning Object
— Unverified 0V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models Feb 14, 2025 Autonomous Driving Autonomous Vehicles
— Unverified 0Post-training an LLM for RAG? Train on Self-Generated Demonstrations Feb 14, 2025 Attribute Question Answering
— Unverified 0Diversity Enhances an LLM's Performance in RAG and Long-context Task Feb 13, 2025 Diversity Question Answering
— Unverified 0KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG Feb 13, 2025 Knowledge Graphs Large Language Model
Code Code Available 2SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Feb 13, 2025 Question Answering RAG
Code Code Available 4LP-LM: No Hallucinations in Question Answering with Logic Programming Feb 13, 2025 Question Answering Semantic Parsing
Code Code Available 0Beyond English: The Impact of Prompt Translation Strategies across Languages and Tasks in Multilingual LLMs Feb 13, 2025 Abstractive Text Summarization named-entity-recognition
— Unverified 0SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Feb 13, 2025 Long Form Question Answering Question Answering
Code Code Available 0Improving TCM Question Answering through Tree-Organized Self-Reflective Retrieval with LLMs Feb 13, 2025 Question Answering RAG
— Unverified 0Abduction of Domain Relationships from Data for VQA Feb 13, 2025 Question Answering Visual Question Answering
— Unverified 0EmoAssist: Emotional Assistant for Visual Impairment Community Feb 13, 2025 Emotional Intelligence Question Answering
— Unverified 0Visual Graph Question Answering with ASP and LLMs for Language Parsing Feb 13, 2025 Graph Question Answering Optical Character Recognition
— Unverified 0On Mechanistic Circuits for Extractive Question-Answering Feb 12, 2025 Extractive Question-Answering Language Modeling
— Unverified 0Vision-Language Models for Edge Networks: A Comprehensive Survey Feb 11, 2025 Autonomous Vehicles Image Captioning
— Unverified 0Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning Feb 11, 2025 Hallucination In-Context Learning
Code Code Available 0ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval Feb 11, 2025 Answer Generation Question Answering
— Unverified 0Making Language Models Robust Against Negation Feb 11, 2025 Natural Language Understanding Negation
— Unverified 0EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering Feb 11, 2025 Question Answering Video Question Answering
Code Code Available 1