MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generation Sep 29, 2024 Language Modeling Language Modelling
Code Code Available 0See then Tell: Enhancing Key Information Extraction with Vision Grounding Sep 29, 2024 Image to text Key Information Extraction
— Unverified 0CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering Sep 29, 2024 Graph Question Answering Question Answering
Code Code Available 1T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition Sep 29, 2024 In-Context Learning Question Answering
Code Code Available 1HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations Sep 28, 2024 Dataset Generation Informativeness
— Unverified 0Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models Sep 28, 2024 Multi-hop Question Answering Question Answering
— Unverified 0TrojVLM: Backdoor Attack Against Vision Language Models Sep 28, 2024 Backdoor Attack Image Captioning
— Unverified 03D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models Sep 28, 2024 Diagnostic Language Modeling
— Unverified 0Revisiting the Superficial Alignment Hypothesis Sep 27, 2024 Instruction Following Math
— Unverified 0Rehearsing Answers to Probable Questions with Perspective-Taking Sep 27, 2024 Common Sense Reasoning Knowledge Graphs
— Unverified 0AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow Sep 27, 2024 Medical Question Answering Question Answering
— Unverified 0Exploring Language Model Generalization in Low-Resource Extractive QA Sep 27, 2024 Domain Generalization Extractive Question-Answering
Code Code Available 0Charting the Future: Using Chart Question-Answering for Scalable Evaluation of LLM-Driven Data Visualizations Sep 27, 2024 Chart Question Answering Question Answering
— Unverified 0Enhancing Explainability in Multimodal Large Language Models Using Ontological Context Sep 27, 2024 Image Captioning Question Answering
— Unverified 0DisGeM: Distractor Generation for Multiple Choice Questions with Span Masking Sep 26, 2024 Distractor Generation Multiple-choice
Code Code Available 0Efficient In-Domain Question Answering for Resource-Constrained Environments Sep 26, 2024 parameter-efficient fine-tuning Prompt Engineering
— Unverified 0Integrating Hierarchical Semantic into Iterative Generation Model for Entailment Tree Explanation Sep 26, 2024 Question Answering
— Unverified 0Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization Sep 26, 2024 Image to text Image-to-Text Retrieval
— Unverified 0Episodic Memory Verbalization using Hierarchical Representations of Life-Long Robot Experience Sep 26, 2024 Language Modeling Language Modelling
— Unverified 0ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue Sep 26, 2024 Medical Visual Question Answering Question Answering
— Unverified 0T3: A Novel Zero-shot Transfer Learning Framework Iteratively Training on an Assistant Task for a Target Task Sep 26, 2024 Question Answering Semantic Similarity
— Unverified 0Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE Sep 26, 2024 image-classification Image Classification
Code Code Available 1E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding Sep 26, 2024 Question Answering Video Understanding
Code Code Available 2DARE: Diverse Visual Question Answering with Robustness Evaluation Sep 26, 2024 image-classification Image Classification
— Unverified 0Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition Sep 25, 2024 In-Context Learning Question Answering
— Unverified 0SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA Sep 25, 2024 Answer Selection Question Answering
Code Code Available 0Detecting Temporal Ambiguity in Questions Sep 25, 2024 Open-Domain Question Answering Question Answering
Code Code Available 0Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question Answering Sep 25, 2024 Question Answering Sensitivity
— Unverified 0Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering Sep 24, 2024 Answer Generation Question Answering
Code Code Available 0Konstruktor: A Strong Baseline for Simple Knowledge Graph Question Answering Sep 24, 2024 Entity Linking Graph Question Answering
Code Code Available 0Lighter And Better: Towards Flexible Context Adaptation For Retrieval Augmented Generation Sep 24, 2024 Question Answering RAG
— Unverified 0Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation Sep 24, 2024 Question Answering Text Generation
— Unverified 0A Unified Hallucination Mitigation Framework for Large Vision-Language Models Sep 24, 2024 Hallucination Question Answering
Code Code Available 0Exploring Hint Generation Approaches in Open-Domain Question Answering Sep 24, 2024 Hint Generation Open-Domain Question Answering
Code Code Available 1From Pixels to Words: Leveraging Explainability in Face Recognition through Interactive Natural Language Processing Sep 24, 2024 Chatbot Explainable artificial intelligence
— Unverified 0A Zero-Shot Open-Vocabulary Pipeline for Dialogue Understanding Sep 24, 2024 Dialogue State Tracking Dialogue Understanding
Code Code Available 060 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering Sep 24, 2024 Question Answering World Knowledge
— Unverified 0AsthmaBot: Multi-modal, Multi-Lingual Retrieval Augmented Generation For Asthma Patient Support Sep 24, 2024 Hallucination Question Answering
— Unverified 0MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models Sep 23, 2024 Medical Visual Question Answering Question Answering
Code Code Available 1Learning When to Retrieve, What to Rewrite, and How to Respond in Conversational QA Sep 23, 2024 Conversational Question Answering Information Retrieval
— Unverified 0Detect, Describe, Discriminate: Moving Beyond VQA for MLLM Evaluation Sep 23, 2024 Multiple-choice Question Answering
— Unverified 0GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation Sep 23, 2024 Question Answering RAG
— Unverified 0Boosting Healthcare LLMs Through Retrieved Context Sep 23, 2024 Benchmarking Multiple-choice
Code Code Available 1Using Similarity to Evaluate Factual Consistency in Summaries Sep 23, 2024 Natural Language Inference Question Answering
— Unverified 0Towards Efficient and Robust VQA-NLE Data Generation with Large Vision-Language Models Sep 23, 2024 Decision Making Question Answering
Code Code Available 0A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor? Sep 23, 2024 Hallucination MedQA
— Unverified 0Can CLIP Count Stars? An Empirical Study on Quantity Bias in CLIP Sep 23, 2024 Image Generation Question Answering
— Unverified 0LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs Sep 23, 2024 Learning-To-Rank Question Answering
— Unverified 0Scene-Text Grounding for Text-Based Video Question Answering Sep 22, 2024 2k Contrastive Learning
Code Code Available 1Evaluating the Performance and Robustness of LLMs in Materials Science Q&A and Property Predictions Sep 22, 2024 Band Gap In-Context Learning
— Unverified 0