Question: How do Large Language Models perform on the Question Answering tasks? Answer: Dec 17, 2024 Articles Instruction Following
— Unverified 0When to Speak, When to Abstain: Contrastive Decoding with Abstention Dec 17, 2024 Hallucination Question Answering
— Unverified 0EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation Dec 17, 2024 Question Answering RAG
Code Code Available 1Modality-Inconsistent Continual Learning of Multimodal Large Language Models Dec 17, 2024 Continual Learning Knowledge Distillation
— Unverified 0LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering Dec 16, 2024 In-Context Learning Instruction Following
Code Code Available 0BioRAGent: A Retrieval-Augmented Generation System for Showcasing Generative Query Expansion and Domain-Specific Search for Scientific Q&A Dec 16, 2024 Answer Generation Few-Shot Learning
Code Code Available 0Interpretable LLM-based Table Question Answering Dec 16, 2024 POS Question Answering
— Unverified 0Context Filtering with Reward Modeling in Question Answering Dec 16, 2024 Question Answering
— Unverified 0UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models Dec 16, 2024 Question Answering
Code Code Available 1ConceptEdit: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning Dec 16, 2024 knowledge editing Question Answering
— Unverified 0DARWIN 1.5: Large Language Models as Materials Science Adapted Learners Dec 16, 2024 Large Language Model Multi-Task Learning
Code Code Available 3Advancements and Challenges in Bangla Question Answering Models: A Comprehensive Review Dec 16, 2024 Articles Question Answering
— Unverified 0SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types Dec 16, 2024 Question Answering
Code Code Available 1Precise Length Control in Large Language Models Dec 16, 2024 Decoder Document Summarization
— Unverified 0ACE-M^3: Automatic Capability Evaluator for Multimodal Medical Models Dec 16, 2024 Question Answering
— Unverified 0CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding Dec 16, 2024 Hallucination Multiple-choice
— Unverified 0CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology Dec 16, 2024 Language Modeling Language Modelling
— Unverified 0AgentPS: Agentic Process Supervision for Multi-modal Content Quality Assurance through Multi-round QA Dec 15, 2024 Question Answering
— Unverified 0Cultural Palette: Pluralising Culture Alignment via Multi-agent Palette Dec 15, 2024 Large Language Model Question Answering
— Unverified 0Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track Dec 15, 2024 Image Captioning Medical Question Answering
— Unverified 0MedG-KRP: Medical Graph Knowledge Representation Probing Dec 14, 2024 Multiple-choice Multiple Choice Question Answering (MCQA)
Code Code Available 0NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries Dec 14, 2024 Benchmarking Embodied Question Answering
— Unverified 0Damage Assessment after Natural Disasters with UAVs: Semantic Feature Extraction using Deep Learning Dec 14, 2024 Decision Making Question Answering
— Unverified 0Patch-level Sounding Object Tracking for Audio-Visual Question Answering Dec 14, 2024 Audio-visual Question Answering Object Tracking
— Unverified 0VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation Dec 14, 2024 Question Answering RAG
— Unverified 0Evidence Contextualization and Counterfactual Attribution for Conversational QA over Heterogeneous Data with RAG Systems Dec 13, 2024 Answer Generation Conversational Question Answering
— Unverified 0Benchmarking Table Comprehension In The Wild Dec 13, 2024 Benchmarking Question Answering
— Unverified 0LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering Dec 13, 2024 Few-Shot Learning Knowledge Distillation
— Unverified 0Lost in the Middle, and In-Between: Enhancing Language Models' Ability to Reason Over Long Contexts in Multi-Hop QA Dec 13, 2024 Multi-hop Question Answering Question Answering
Code Code Available 0VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation Dec 13, 2024 Instruction Following Question Answering
— Unverified 0IQViC: In-context, Question Adaptive Vision Compressor for Long-term Video Understanding LMMs Dec 13, 2024 Question Answering Video Question Answering
— Unverified 0DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Dec 13, 2024 Chart Understanding Mixture-of-Experts
Code Code Available 9RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector Dec 13, 2024 In-Context Learning Question Answering
Code Code Available 1OG-RAG: Ontology-Grounded Retrieval-Augmented Generation For Large Language Models Dec 12, 2024 Question Answering RAG
— Unverified 0ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation Dec 12, 2024 Phrase Grounding Question Answering
— Unverified 0ViUniT: Visual Unit Tests for More Robust Visual Programming Dec 12, 2024 Image Generation Image-text matching
— Unverified 0Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors Dec 12, 2024 Question Answering
Code Code Available 1Multi-Scale Heterogeneous Text-Attributed Graph Datasets From Diverse Domains Dec 12, 2024 Community Question Answering Graph Learning
Code Code Available 0Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies Dec 12, 2024 Question Answering RAG
— Unverified 0Neptune: The Long Orbit to Benchmarking Long Video Understanding Dec 12, 2024 Benchmarking Multimodal Reasoning
Code Code Available 2Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering Dec 12, 2024 feature selection Language Modeling
— Unverified 0Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine Dec 12, 2024 Language Modeling Language Modelling
Code Code Available 2Doe-1: Closed-Loop Autonomous Driving with Large World Model Dec 12, 2024 Autonomous Driving Decision Making
Code Code Available 2A Multimodal Social Agent Dec 11, 2024 Common Sense Reasoning Decision Making
— Unverified 0DialogAgent: An Auto-engagement Agent for Code Question Answering Data Production Dec 11, 2024 Code Generation Question Answering
— Unverified 0Can We Generate Visual Programs Without Prompting LLMs? Dec 11, 2024 Data Augmentation Question Answering
— Unverified 0Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models Dec 11, 2024 Question Answering Visual Grounding
Code Code Available 0In-Context Learning with Topological Information for Knowledge Graph Completion Dec 11, 2024 In-Context Learning Information Retrieval
— Unverified 0Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions Dec 11, 2024 Benchmarking Question Answering
Code Code Available 0Discrete Subgraph Sampling for Interpretable Graph based Visual Question Answering Dec 11, 2024 Explainable artificial intelligence Explainable Artificial Intelligence (XAI)
Code Code Available 0