Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models Oct 1, 2024 Question Answering Visual Question Answering
Code Code Available 0Benchmarking Large Language Models for Conversational Question Answering in Multi-instructional Documents Oct 1, 2024 Benchmarking Conversational Question Answering
— Unverified 0Quantifying reliance on external information over parametric knowledge during Retrieval Augmented Generation (RAG) using mechanistic analysis Oct 1, 2024 Information Retrieval Language Modeling
— Unverified 0BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data Oct 1, 2024 Code Generation Logical Reasoning
Code Code Available 0Semantic Parsing with Candidate Expressions for Knowledge Base Question Answering Oct 1, 2024 Knowledge Base Question Answering Question Answering
Code Code Available 0Addition is All You Need for Energy-efficient Language Models Oct 1, 2024 All Natural Language Understanding
— Unverified 0FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks Oct 1, 2024 Benchmarking Fairness
— Unverified 0World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering Sep 30, 2024 Optical Character Recognition (OCR) Question Answering
Code Code Available 0Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models Sep 30, 2024 Few-Shot Learning In-Context Learning
Code Code Available 0See then Tell: Enhancing Key Information Extraction with Vision Grounding Sep 29, 2024 Image to text Key Information Extraction
— Unverified 0MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generation Sep 29, 2024 Language Modeling Language Modelling
Code Code Available 0Towards Robust Extractive Question Answering Models: Rethinking the Training Methodology Sep 29, 2024 Extractive Question-Answering Question Answering
Code Code Available 0Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems Sep 29, 2024 Fairness Open-Domain Question Answering
Code Code Available 0Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding Sep 29, 2024 Diversity Question Answering
— Unverified 0Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models Sep 28, 2024 Multi-hop Question Answering Question Answering
— Unverified 03D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models Sep 28, 2024 Diagnostic Language Modeling
— Unverified 0HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations Sep 28, 2024 Dataset Generation Informativeness
— Unverified 0TrojVLM: Backdoor Attack Against Vision Language Models Sep 28, 2024 Backdoor Attack Image Captioning
— Unverified 0Charting the Future: Using Chart Question-Answering for Scalable Evaluation of LLM-Driven Data Visualizations Sep 27, 2024 Chart Question Answering Question Answering
— Unverified 0Exploring Language Model Generalization in Low-Resource Extractive QA Sep 27, 2024 Domain Generalization Extractive Question-Answering
Code Code Available 0AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow Sep 27, 2024 Medical Question Answering Question Answering
— Unverified 0Enhancing Explainability in Multimodal Large Language Models Using Ontological Context Sep 27, 2024 Image Captioning Question Answering
— Unverified 0Rehearsing Answers to Probable Questions with Perspective-Taking Sep 27, 2024 Common Sense Reasoning Knowledge Graphs
— Unverified 0Revisiting the Superficial Alignment Hypothesis Sep 27, 2024 Instruction Following Math
— Unverified 0T3: A Novel Zero-shot Transfer Learning Framework Iteratively Training on an Assistant Task for a Target Task Sep 26, 2024 Question Answering Semantic Similarity
— Unverified 0ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue Sep 26, 2024 Medical Visual Question Answering Question Answering
— Unverified 0Episodic Memory Verbalization using Hierarchical Representations of Life-Long Robot Experience Sep 26, 2024 Language Modeling Language Modelling
— Unverified 0Efficient In-Domain Question Answering for Resource-Constrained Environments Sep 26, 2024 parameter-efficient fine-tuning Prompt Engineering
— Unverified 0DisGeM: Distractor Generation for Multiple Choice Questions with Span Masking Sep 26, 2024 Distractor Generation Multiple-choice
Code Code Available 0Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization Sep 26, 2024 Image to text Image-to-Text Retrieval
— Unverified 0DARE: Diverse Visual Question Answering with Robustness Evaluation Sep 26, 2024 image-classification Image Classification
— Unverified 0Integrating Hierarchical Semantic into Iterative Generation Model for Entailment Tree Explanation Sep 26, 2024 Question Answering
— Unverified 0Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question Answering Sep 25, 2024 Question Answering Sensitivity
— Unverified 0Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition Sep 25, 2024 In-Context Learning Question Answering
— Unverified 0Detecting Temporal Ambiguity in Questions Sep 25, 2024 Open-Domain Question Answering Question Answering
Code Code Available 0SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA Sep 25, 2024 Answer Selection Question Answering
Code Code Available 0Konstruktor: A Strong Baseline for Simple Knowledge Graph Question Answering Sep 24, 2024 Entity Linking Graph Question Answering
Code Code Available 0Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering Sep 24, 2024 Answer Generation Question Answering
Code Code Available 0From Pixels to Words: Leveraging Explainability in Face Recognition through Interactive Natural Language Processing Sep 24, 2024 Chatbot Explainable artificial intelligence
— Unverified 0Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation Sep 24, 2024 Question Answering Text Generation
— Unverified 060 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering Sep 24, 2024 Question Answering World Knowledge
— Unverified 0A Zero-Shot Open-Vocabulary Pipeline for Dialogue Understanding Sep 24, 2024 Dialogue State Tracking Dialogue Understanding
Code Code Available 0A Unified Hallucination Mitigation Framework for Large Vision-Language Models Sep 24, 2024 Hallucination Question Answering
Code Code Available 0Lighter And Better: Towards Flexible Context Adaptation For Retrieval Augmented Generation Sep 24, 2024 Question Answering RAG
— Unverified 0AsthmaBot: Multi-modal, Multi-Lingual Retrieval Augmented Generation For Asthma Patient Support Sep 24, 2024 Hallucination Question Answering
— Unverified 0Towards Efficient and Robust VQA-NLE Data Generation with Large Vision-Language Models Sep 23, 2024 Decision Making Question Answering
Code Code Available 0LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs Sep 23, 2024 Learning-To-Rank Question Answering
— Unverified 0A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor? Sep 23, 2024 Hallucination MedQA
— Unverified 0Can CLIP Count Stars? An Empirical Study on Quantity Bias in CLIP Sep 23, 2024 Image Generation Question Answering
— Unverified 0GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation Sep 23, 2024 Question Answering RAG
— Unverified 0