Explore the Limits of Omni-modal Pretraining at Scale Jun 13, 2024 Language Modeling Language Modelling
Code Code Available 2VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis Mar 29, 2024 Hallucination Image Captioning
Code Code Available 2FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models Feb 21, 2024 Question Answering
Code Code Available 2E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding Sep 26, 2024 Question Answering Video Understanding
Code Code Available 2ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis Mar 11, 2024 Question Answering
Code Code Available 2Evaluating LLM Reasoning in the Operations Research Domain with ORQA Dec 22, 2024 Question Answering
Code Code Available 2A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis Mar 10, 2025 Question Answering
Code Code Available 2End-To-End Memory Networks Mar 31, 2015 Language Modeling Language Modelling
Code Code Available 2End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 2Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion May 4, 2022 Information Retrieval Knowledge Graph Completion
Code Code Available 2EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Jan 21, 2025 Attribute Question Answering
Code Code Available 2Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning May 27, 2024 Question Answering RAG
Code Code Available 2Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement May 24, 2024 Hallucination Image Comprehension
Code Code Available 2AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Dec 19, 2024 Autonomous Driving Benchmarking
Code Code Available 2Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework Jun 20, 2024 Hallucination Question Answering
Code Code Available 2FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design Nov 23, 2023 Decision Making Language Modelling
Code Code Available 2GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI Nov 21, 2024 Decision Making Language Modeling
Code Code Available 2Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image Analysis Mar 25, 2025 Contrastive Learning Image-text Retrieval
Code Code Available 2ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO Jun 17, 2024 Language Modelling Question Answering
Code Code Available 2EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding Aug 17, 2023 Diagnostic EgoSchema
Code Code Available 1EgoTaskQA: Understanding Human Tasks in Egocentric Videos Oct 8, 2022 Action Localization counterfactual
Code Code Available 1AllenAct: A Framework for Embodied AI Research Aug 28, 2020 Deep Reinforcement Learning Embodied Question Answering
Code Code Available 1MatTools: Benchmarking Large Language Models for Materials Science Tools May 16, 2025 Benchmarking Question Answering
Code Code Available 1Efficient Passage Retrieval with Hashing for Open-domain Question Answering Jun 2, 2021 Natural Questions Open-Domain Question Answering
Code Code Available 1EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering Feb 11, 2025 Question Answering Video Question Answering
Code Code Available 1Effective Human-AI Teams via Learned Natural Language Rules and Onboarding Nov 2, 2023 Language Modeling Language Modelling
Code Code Available 1Efficient and Reproducible Biomedical Question Answering using Retrieval Augmented Generation May 12, 2025 Question Answering RAG
Code Code Available 1Structure-aware Domain Knowledge Injection for Large Language Models Jul 23, 2024 Question Answering
Code Code Available 1Adaptive Information Seeking for Open-Domain Question Answering Sep 14, 2021 Open-Domain Question Answering Question Answering
Code Code Available 1Educational Question Generation of Children Storybooks via Question Type Distribution Learning and Event-Centric Summarization Mar 27, 2022 Question Answering Question Generation
Code Code Available 1Beyond NED: Fast and Effective Search Space Reduction for Complex Question Answering over Knowledge Bases Aug 19, 2021 Entity Disambiguation Knowledge Graphs
Code Code Available 1ECoRAG: Evidentiality-guided Compression for Long Context RAG Jun 5, 2025 Answer Generation Open-Domain Question Answering
Code Code Available 1An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation Jun 3, 2024 Answer Generation Question Answering
Code Code Available 1Ranked Voting based Self-Consistency of Large Language Models May 16, 2025 Multiple-choice Open-Ended Question Answering
Code Code Available 1ECG-QA: A Comprehensive Question Answering Dataset Combined With Electrocardiogram Jun 21, 2023 Question Answering
Code Code Available 1Editing Factual Knowledge in Language Models Apr 16, 2021 Fact Checking Meta-Learning
Code Code Available 1Efficiently Tuned Parameters are Task Embeddings Oct 21, 2022 Question Answering Text Classification
Code Code Available 1EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric Videos Mar 28, 2025 Benchmarking Question Answering
Code Code Available 1EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL Jun 20, 2022 Question Answering Question Generation
Code Code Available 1AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning Mar 18, 2023 parameter-efficient fine-tuning Question Answering
Code Code Available 1EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering Dec 19, 2023 Object Object Counting
Code Code Available 1Adapting Pretrained Text-to-Text Models for Long Text Sequences Sep 21, 2022 Long-range modeling Question Answering
Code Code Available 1EA^2E: Improving Consistency with Event Awareness for Document-Level Argument Extraction Jul 1, 2022 Event Argument Extraction Knowledge Base Population
Code Code Available 1Dynamic Semantic Graph Construction and Reasoning for Explainable Multi-hop Science Question Answering May 25, 2021 Abstract Meaning Representation ARC
Code Code Available 1Dynamic Relevance Graph Network for Knowledge-Aware Question Answering Sep 20, 2022 Graph Neural Network Question Answering
Code Code Available 1DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines Nov 17, 2023 Language Modelling Large Language Model
Code Code Available 1Dynamic Language Binding in Relational Visual Reasoning Apr 30, 2020 Object Question Answering
Code Code Available 1Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency Apr 24, 2025 Benchmarking Math
Code Code Available 1Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping Oct 11, 2024 MME Question Answering
Code Code Available 1EA^2E: Improving Consistency with Event Awareness for Document-Level Argument Extraction May 30, 2022 Event Argument Extraction Knowledge Base Population
Code Code Available 1