E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding Sep 26, 2024 Question Answering Video Understanding
Code Code Available 25 ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis Mar 11, 2024 Question Answering
Code Code Available 25 GreaseLM: Graph REASoning Enhanced Language Models for Question Answering Jan 21, 2022 Knowledge Graphs Medical Question Answering
Code Code Available 25 Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement May 24, 2024 Hallucination Image Comprehension
Code Code Available 25 An Embodied Generalist Agent in 3D World Nov 18, 2023 3D dense captioning 3D Question Answering (3D-QA)
Code Code Available 25 VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis Mar 29, 2024 Hallucination Image Captioning
Code Code Available 25 End-To-End Memory Networks Mar 31, 2015 Language Modeling Language Modelling
Code Code Available 25 EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Jan 21, 2025 Attribute Question Answering
Code Code Available 25 Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning May 27, 2024 Question Answering RAG
Code Code Available 25 How Much are Large Language Models Contaminated? A Comprehensive Survey and the LLMSanitize Library Mar 31, 2024 Question Answering
Code Code Available 25 End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-Answering Nov 8, 2024 Language Modeling Language Modelling
Code Code Available 25 Hungry Hungry Hippos: Towards Language Modeling with State Space Models Dec 28, 2022 8k Coreference Resolution
Code Code Available 25 FinBERT-QA: Financial Question Answering with pre-trained BERT Language Models Apr 24, 2025 Answer Selection Information Retrieval
Code Code Available 25 Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Oct 6, 2023 Code Generation Decision Making
Code Code Available 25 Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models Jan 27, 2024 Medical Question Answering Multiple-choice
Code Code Available 25 Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning Jan 25, 2025 Answer Generation Multi-agent Reinforcement Learning
Code Code Available 25 MiniGPT-Med: Large Language Model as a General Interface for Radiology Diagnosis Jul 4, 2024 Diagnostic Language Modeling
Code Code Available 25 ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO Jun 17, 2024 Language Modelling Question Answering
Code Code Available 25 JourneyDB: A Benchmark for Generative Image Understanding Jul 3, 2023 Image Captioning Image Comprehension
Code Code Available 25 CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes Apr 1, 2024 Causal Discovery Causal Discovery in Video Reasoning
Code Code Available 15 EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding Aug 17, 2023 Diagnostic EgoSchema
Code Code Available 15 Causal Distillation for Language Models Dec 5, 2021 Language Modeling Language Modelling
Code Code Available 15 EgoTaskQA: Understanding Human Tasks in Egocentric Videos Oct 8, 2022 Action Localization counterfactual
Code Code Available 15 AllenAct: A Framework for Embodied AI Research Aug 28, 2020 Deep Reinforcement Learning Embodied Question Answering
Code Code Available 15 Efficient Passage Retrieval with Hashing for Open-domain Question Answering Jun 2, 2021 Natural Questions Open-Domain Question Answering
Code Code Available 15 MatTools: Benchmarking Large Language Models for Materials Science Tools May 16, 2025 Benchmarking Question Answering
Code Code Available 15 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering Feb 11, 2025 Question Answering Video Question Answering
Code Code Available 15 Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models Nov 14, 2023 Continual Learning Question Answering
Code Code Available 15 CARE: Collaborative AI-Assisted Reading Environment Feb 24, 2023 Question Answering text-classification
Code Code Available 15 Efficiently Tuned Parameters are Task Embeddings Oct 21, 2022 Question Answering Text Classification
Code Code Available 15 Capturing Row and Column Semantics in Transformer Based Question Answering over Tables Apr 16, 2021 Question Answering
Code Code Available 15 Adaptive Information Seeking for Open-Domain Question Answering Sep 14, 2021 Open-Domain Question Answering Question Answering
Code Code Available 15 Beyond NED: Fast and Effective Search Space Reduction for Complex Question Answering over Knowledge Bases Aug 19, 2021 Entity Disambiguation Knowledge Graphs
Code Code Available 15 Effective Human-AI Teams via Learned Natural Language Rules and Onboarding Nov 2, 2023 Language Modeling Language Modelling
Code Code Available 15 Structure-aware Domain Knowledge Injection for Large Language Models Jul 23, 2024 Question Answering
Code Code Available 15 Ranked Voting based Self-Consistency of Large Language Models May 16, 2025 Multiple-choice Open-Ended Question Answering
Code Code Available 15 Educational Question Generation of Children Storybooks via Question Type Distribution Learning and Event-Centric Summarization Mar 27, 2022 Question Answering Question Generation
Code Code Available 15 Efficient and Reproducible Biomedical Question Answering using Retrieval Augmented Generation May 12, 2025 Question Answering RAG
Code Code Available 15 EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric Videos Mar 28, 2025 Benchmarking Question Answering
Code Code Available 15 Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model Dec 18, 2022 Language Modeling Language Modelling
Code Code Available 15 AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning Mar 18, 2023 parameter-efficient fine-tuning Question Answering
Code Code Available 15 Can't Remember Details in Long Documents? You Need Some R&R Mar 8, 2024 Question Answering
Code Code Available 15 Adapting Pretrained Text-to-Text Models for Long Text Sequences Sep 21, 2022 Long-range modeling Question Answering
Code Code Available 15 EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models Nov 27, 2023 Attribute Question Answering
Code Code Available 15 Can questions summarize a corpus? Using question generation for characterizing COVID-19 research Sep 19, 2020 Articles Question Answering
Code Code Available 15 Can Question Rewriting Help Conversational Question Answering? Apr 13, 2022 Conversational Question Answering Question Answering
Code Code Available 15 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Jan 9, 2025 Fairness Hallucination
Code Code Available 15 Can NLI Models Verify QA Systems’ Predictions? Nov 1, 2021 Natural Language Inference Question Answering
Code Code Available 15 Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency Apr 24, 2025 Benchmarking Math
Code Code Available 15 Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions? Feb 23, 2023 Open-Domain Question Answering Question Answering
Code Code Available 15