Uncovering Bias in Large Vision-Language Models at Scale with Counterfactuals May 30, 2024 counterfactual Question Answering
— Unverified 0ANAH: Analytical Annotation of Hallucinations in Large Language Models May 30, 2024 Generative Question Answering Hallucination
Code Code Available 2MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions May 29, 2024 Benchmarking Dialogue Understanding
Code Code Available 1Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs May 29, 2024 Image Retrieval Question Answering
Code Code Available 1MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection May 29, 2024 Abstract Meaning Representation Hallucination
— Unverified 0PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering May 29, 2024 Diversity Logical Reasoning
— Unverified 0MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification May 29, 2024 Hallucination Image Captioning
— Unverified 0Evaluating Zero-Shot GPT-4V Performance on 3D Visual Question Answering Benchmarks May 29, 2024 Question Answering Visual Question Answering
— Unverified 0Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study May 29, 2024 Answer Generation Hallucination
— Unverified 0A Multi-Source Retrieval Question Answering Framework Based on RAG May 29, 2024 Question Answering RAG
— Unverified 0Data-augmented phrase-level alignment for mitigating object hallucination May 28, 2024 Data Augmentation Hallucination
— Unverified 0RealitySummary: Exploring On-Demand Mixed Reality Text Summarization and Question Answering using Large Language Models May 28, 2024 Document Enhancement Mixed Reality
— Unverified 0Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering May 28, 2024 Question Answering
Code Code Available 0Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs May 28, 2024 Question Answering RAG
— Unverified 0ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator May 28, 2024 Information Retrieval Language Modelling
Code Code Available 0Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action May 28, 2024 Conversational Question Answering Hallucination
— Unverified 0Aligning LLMs through Multi-perspective User Preference Ranking-based Feedback for Programming Question Answering May 27, 2024 Community Question Answering In-Context Learning
— Unverified 0Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning May 27, 2024 Question Answering RAG
Code Code Available 2Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? May 27, 2024 Question Answering
— Unverified 0Hawk: Learning to Understand Open-World Video Anomalies May 27, 2024 Anomaly Detection Question Answering
Code Code Available 3Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model May 27, 2024 Decoder Language Modeling
Code Code Available 2Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR May 27, 2024 Question Answering TAG
— Unverified 0THREAD: Thinking Deeper with Recursive Spawning May 27, 2024 Few-Shot Learning Question Answering
Code Code Available 1Cost-efficient Knowledge-based Question Answering with Large Language Models May 27, 2024 Knowledge Graphs Model Selection
— Unverified 0On Bits and Bandits: Quantifying the Regret-Information Trade-off May 26, 2024 Decision Making Question Answering
Code Code Available 0Accurate and Nuanced Open-QA Evaluation Through Textual Entailment May 26, 2024 Natural Language Inference Open-Domain Question Answering
Code Code Available 0Map-based Modular Approach for Zero-shot Embodied Question Answering May 26, 2024 Embodied Question Answering Navigate
Code Code Available 1Crafting Interpretable Embeddings by Asking LLMs Questions May 26, 2024 Question Answering
Code Code Available 2iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers May 25, 2024 Common Sense Reasoning Multiple-choice
Code Code Available 0Generating clickbait spoilers with an ensemble of large language models May 25, 2024 Passage Retrieval Question Answering
— Unverified 0Streaming Long Video Understanding with Large Language Models May 25, 2024 Question Answering Video Understanding
— Unverified 0Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention May 25, 2024 Question Answering Sentence
— Unverified 0Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data May 25, 2024 Question Answering
— Unverified 0Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement May 24, 2024 Hallucination Image Comprehension
Code Code Available 2Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges May 24, 2024 Document Summarization Multi-Document Summarization
Code Code Available 0OptLLM: Optimal Assignment of Queries to Large Language Models May 24, 2024 Log Parsing Multi-Label Classification
Code Code Available 0Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top May 24, 2024 knowledge editing Multi-hop Question Answering
— Unverified 0Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models May 24, 2024 Question Answering Visual Question Answering
— Unverified 0AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings May 23, 2024 Open-Domain Question Answering Question Answering
— Unverified 0LOVA3: Learning to Visual Question Answering, Asking and Assessment May 23, 2024 Question Answering Visual Question Answering
Code Code Available 2Large Language Models Can Self-Correct with Key Condition Verification May 23, 2024 Arithmetic Reasoning Math
— Unverified 0HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models May 23, 2024 Hippocampus Knowledge Graphs
Code Code Available 7AGILE: A Novel Reinforcement Learning Framework of LLM Agents May 23, 2024 Question Answering reinforcement-learning
Code Code Available 2WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models May 23, 2024 Hallucination Model Editing
— Unverified 0SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge May 23, 2024 Question Answering RAG
— Unverified 0Efficient Medical Question Answering with Knowledge-Augmented Question Generation May 23, 2024 Language Modeling Language Modelling
Code Code Available 0Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering May 23, 2024 Open-Ended Question Answering Question Answering
— Unverified 0A Survey on Vision-Language-Action Models for Embodied AI May 23, 2024 Image Captioning Instruction Following
Code Code Available 4PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery May 22, 2024 Question Answering Visual Question Answering
Code Code Available 1Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation May 22, 2024 Informativeness Language Modeling
Code Code Available 2