Hungry Hungry Hippos: Towards Language Modeling with State Space Models Dec 28, 2022 8k Coreference Resolution
Code Code Available 2Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning Jul 29, 2024 Chart Question Answering Question Answering
Code Code Available 2GreaseLM: Graph REASoning Enhanced Language Models for Question Answering Jan 21, 2022 Knowledge Graphs Medical Question Answering
Code Code Available 2Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval Jan 2, 2021 Claim Verification Question Answering
Code Code Available 2LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions Apr 27, 2023 Common Sense Reasoning Coreference Resolution
Code Code Available 2BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains Feb 15, 2024 Few-Shot Learning Medical Question Answering
Code Code Available 2GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks Feb 11, 2024 Graph Question Answering Instruction Following
Code Code Available 2AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Dec 19, 2024 Autonomous Driving Benchmarking
Code Code Available 2GOFA: A Generative One-For-All Model for Joint Graph Language Modeling Jul 12, 2024 All Language Modeling
Code Code Available 2BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra Feb 27, 2024 Question Answering
Code Code Available 2Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning Jun 17, 2024 Data Augmentation Mathematical Reasoning
Code Code Available 2Learning Dense Representations of Phrases at Scale Dec 23, 2020 Open-Domain Question Answering Question Answering
Code Code Available 2Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering Sep 20, 2022 Multimodal Deep Learning Multimodal Reasoning
Code Code Available 2Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling Jun 18, 2024 Arithmetic Reasoning Language Modeling
Code Code Available 2Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA Jun 25, 2024 Benchmarking Long-Context Understanding
Code Code Available 2LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers Feb 25, 2025 Multi-hop Question Answering Question Answering
Code Code Available 2Grounded 3D-LLM with Referent Tokens May 16, 2024 Dense Captioning Diversity
Code Code Available 2ktrain: A Low-Code Library for Augmented Machine Learning Apr 19, 2020 BIG-bench Machine Learning Classification
Code Code Available 2Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy Dec 20, 2023 Language Modeling Language Modelling
Code Code Available 2Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation May 22, 2024 Informativeness Language Modeling
Code Code Available 2GeoChat: Grounded Large Vision-Language Model for Remote Sensing Nov 24, 2023 Instruction Following Language Modeling
Code Code Available 2Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question Answering Apr 23, 2024 Graph Question Answering Hallucination
Code Code Available 2GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering Feb 4, 2024 Language Modeling Language Modelling
Code Code Available 2GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities Jun 17, 2024 Audio Question Answering Instruction Following
Code Code Available 2Frozen Transformers in Language Models Are Effective Visual Encoder Layers Oct 19, 2023 Action Recognition Image-text Retrieval
Code Code Available 2LLMGA: Multimodal Large Language Model based Generation Assistant Nov 27, 2023 Image Generation Language Modeling
Code Code Available 2GIT: A Generative Image-to-text Transformer for Vision and Language May 27, 2022 Decoder Image Captioning
Code Code Available 2LLoCO: Learning Long Contexts Offline Apr 11, 2024 4k In-Context Learning
Code Code Available 2From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models Oct 13, 2023 Hallucination Image Captioning
Code Code Available 2Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs Oct 14, 2024 Computational Efficiency Question Answering
Code Code Available 2FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models Dec 30, 2024 Question Answering Token Reduction
Code Code Available 2Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning Oct 12, 2022 Contrastive Learning Form
Code Code Available 2Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering Nov 25, 2024 Question Answering Visual Question Answering
Code Code Available 2FreeVA: Offline MLLM as Training-Free Video Assistant May 13, 2024 Fairness Question Answering
Code Code Available 2LOVA3: Learning to Visual Question Answering, Asking and Assessment May 23, 2024 Question Answering Visual Question Answering
Code Code Available 2LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Dec 2, 2024 Embodied Question Answering Question Answering
Code Code Available 2F-LMM: Grounding Frozen Large Multimodal Models Jun 9, 2024 General Knowledge Instruction Following
Code Code Available 2FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation Jun 10, 2025 Image-text Retrieval Question Answering
Code Code Available 2FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning Apr 1, 2025 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 2CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios Mar 7, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
Code Code Available 2500xCompressor: Generalized Prompt Compression for Large Language Models Aug 6, 2024 Language Modelling Large Language Model
Code Code Available 2From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks Jun 4, 2024 Image Captioning Language Modelling
Code Code Available 2A Survey on Benchmarks of Multimodal Large Language Models Aug 16, 2024 Question Answering Survey
Code Code Available 2MDETR - Modulated Detection for End-to-End Multi-Modal Understanding Jan 1, 2021 Phrase Grounding Question Answering
Code Code Available 2Fine-Grained Human Feedback Gives Better Rewards for Language Model Training Jun 2, 2023 Language Modeling Language Modelling
Code Code Available 2Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding Jan 9, 2024 Fact Verification In-Context Learning
Code Code Available 2MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Mar 10, 2025 Benchmarking Medical Question Answering
Code Code Available 2Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs Jun 13, 2024 Arithmetic Reasoning Fact Verification
Code Code Available 2Atlas: Few-shot Learning with Retrieval Augmented Language Models Aug 5, 2022 Fact Checking Few-Shot Learning
Code Code Available 2FinBERT-QA: Financial Question Answering with pre-trained BERT Language Models Apr 24, 2025 Answer Selection Information Retrieval
Code Code Available 2