LongVLM: Efficient Long Video Understanding via Large Language Models Apr 4, 2024 Question Answering Video Question Answering
Code Code Available 2Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought Apr 4, 2024 Extractive Question-Answering Knowledge Distillation
— Unverified 0TinyVQA: Compact Multimodal Deep Neural Network for Visual Question Answering on Resource-Constrained Devices Apr 4, 2024 Quantization Question Answering
— Unverified 0Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models Apr 4, 2024 Question Answering
Code Code Available 0Sailor: Open Language Models for South-East Asia Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 4The Death of Feature Engineering? BERT with Linguistic Features on SQuAD 2.0 Apr 4, 2024 Feature Engineering Machine Reading Comprehension
— Unverified 0Learning to Plan and Generate Text with Citations Apr 4, 2024 Long Form Question Answering Question Answering
— Unverified 0Automatic Prompt Selection for Large Language Models Apr 3, 2024 GSM8K Question Answering
— Unverified 0Multi-Granularity Guided Fusion-in-Decoder Apr 3, 2024 Decoder Multi-Task Learning
Code Code Available 1Enhancing Human-Computer Interaction in Chest X-ray Analysis using Vision and Language Model with Eye Gaze Patterns Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0Using Large Language Models to Understand Telecom Standards Apr 2, 2024 Question Answering
— Unverified 0CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems Apr 2, 2024 Form Long Form Question Answering
Code Code Available 1Self-Improvement Programming for Temporal Knowledge Graph Question Answering Apr 2, 2024 Graph Question Answering In-Context Learning
— Unverified 0Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts Apr 2, 2024 In-Context Learning Language Modeling
Code Code Available 0Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity Apr 2, 2024 Abstract Meaning Representation Fact Checking
Code Code Available 0Helmsman of the Masses? Evaluate the Opinion Leadership of Large Language Models in the Werewolf Game Apr 2, 2024 Question Answering
Code Code Available 0Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization Apr 2, 2024 Memorization Open-Domain Question Answering
— Unverified 0mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning Apr 2, 2024 Chart Question Answering Language Modeling
— Unverified 0Stable Code Technical Report Apr 1, 2024 Code Completion Language Modelling
— Unverified 0TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering Apr 1, 2024 Question Answering Video Question Answering
Code Code Available 1VideoDistill: Language-aware Vision Distillation for Video Question Answering Apr 1, 2024 Answer Generation Question Answering
— Unverified 0CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes Apr 1, 2024 Causal Discovery Causal Discovery in Video Reasoning
Code Code Available 1Unveiling Divergent Inductive Biases of LLMs on Temporal Data Apr 1, 2024 Inductive Bias Natural Language Inference
Code Code Available 0Evaluating Text-to-Visual Generation with Image-to-Text Generation Apr 1, 2024 Image to text Question Answering
Code Code Available 3Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward Apr 1, 2024 Instruction Following Language Modeling
Code Code Available 2Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs Apr 1, 2024 Common Sense Reasoning Object
— Unverified 0Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning Apr 1, 2024 Image Captioning Instruction Following
Code Code Available 0How Much are Large Language Models Contaminated? A Comprehensive Survey and the LLMSanitize Library Mar 31, 2024 Question Answering
Code Code Available 2Explainable Multi-hop Question Generation: An End-to-End Approach without Intermediate Question Labeling Mar 31, 2024 Question Answering Question Generation
Code Code Available 0M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models Mar 31, 2024 Image-text Retrieval Language Modeling
Code Code Available 3How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset Mar 30, 2024 Question Answering
Code Code Available 0Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks Mar 30, 2024 Few-Shot Learning Instruction Following
— Unverified 0DOCMASTER: A Unified Platform for Annotation, Training, & Inference in Document Question-Answering Mar 30, 2024 Privacy Preserving Question Answering
— Unverified 0Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training Mar 30, 2024 Contrastive Learning Question Answering
Code Code Available 0Linguistic Calibration of Long-Form Generations Mar 30, 2024 Decision Making Form
Code Code Available 1Multi-hop Question Answering under Temporal Knowledge Editing Mar 30, 2024 knowledge editing Multi-hop Question Answering
— Unverified 0Uncovering Bias in Large Vision-Language Models with Counterfactuals Mar 29, 2024 counterfactual Question Answering
— Unverified 0Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models Mar 29, 2024 Question Answering Visual Question Answering
Code Code Available 2VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis Mar 29, 2024 Hallucination Image Captioning
Code Code Available 2Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want Mar 29, 2024 Instruction Following Language Modelling
Code Code Available 2MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities of Large Language Models Mar 29, 2024 Language Modeling Language Modelling
Code Code Available 0Leveraging Expert Input for Robust and Explainable AI-Assisted Lung Cancer Detection in Chest X-rays Mar 28, 2024 Binary Classification Decision Making
— Unverified 0Are Large Language Models Good at Utility Judgments? Mar 28, 2024 Answer Generation Benchmarking
Code Code Available 0Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving Mar 28, 2024 Autonomous Driving Language Modeling
Code Code Available 2EthioMT: Parallel Corpus for Low-resource Ethiopian Languages Mar 28, 2024 Machine Translation News Classification
— Unverified 0JDocQA: Japanese Document Question Answering Dataset for Generative Language Models Mar 28, 2024 Hallucination Question Answering
Code Code Available 1Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering Mar 28, 2024 Hallucination In-Context Learning
Code Code Available 1MFORT-QA: Multi-hop Few-shot Open Rich Table Question Answering Mar 28, 2024 Few-Shot Learning Question Answering
— Unverified 0Reshaping Free-Text Radiology Notes Into Structured Reports With Generative Transformers Mar 27, 2024 Generative Question Answering Information Retrieval
Code Code Available 0Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective Mar 27, 2024 Question Answering Visual Question Answering
Code Code Available 1