ktrain: A Low-Code Library for Augmented Machine Learning Apr 19, 2020 BIG-bench Machine Learning Classification
Code Code Available 25 LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions Apr 27, 2023 Common Sense Reasoning Coreference Resolution
Code Code Available 25 AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM Mar 6, 2025 Anomaly Detection Language Modeling
Code Code Available 25 Knowledge Graph Prompting for Multi-Document Question Answering Aug 22, 2023 graph construction Open-Domain Question Answering
Code Code Available 25 Knowledge Representation Learning: A Quantitative Review Dec 28, 2018 General Classification Information Retrieval
Code Code Available 25 Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Oct 6, 2023 Code Generation Decision Making
Code Code Available 25 Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding Sep 5, 2024 Question Answering Scene Understanding
Code Code Available 25 JourneyDB: A Benchmark for Generative Image Understanding Jul 3, 2023 Image Captioning Image Comprehension
Code Code Available 25 Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions Dec 20, 2022 Hallucination Question Answering
Code Code Available 25 ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO Jun 17, 2024 Language Modelling Question Answering
Code Code Available 25 Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning Jan 25, 2025 Answer Generation Multi-agent Reinforcement Learning
Code Code Available 25 Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models Jan 27, 2024 Medical Question Answering Multiple-choice
Code Code Available 25 InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions Jan 24, 2024 document understanding Question Answering
Code Code Available 25 Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model Mar 6, 2025 General Knowledge Image Captioning
Code Code Available 25 Huatuo-26M, a Large-scale Chinese Medical QA Dataset May 2, 2023 Language Modeling Language Modelling
Code Code Available 25 How Much are Large Language Models Contaminated? A Comprehensive Survey and the LLMSanitize Library Mar 31, 2024 Question Answering
Code Code Available 25 Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion May 4, 2022 Information Retrieval Knowledge Graph Completion
Code Code Available 25 HMT: Hierarchical Memory Transformer for Long Context Language Processing May 9, 2024 Language Modeling Language Modelling
Code Code Available 25 Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling Jun 18, 2024 Arithmetic Reasoning Language Modeling
Code Code Available 25 BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra Feb 27, 2024 Question Answering
Code Code Available 25 Hyena Hierarchy: Towards Larger Convolutional Language Models Feb 21, 2023 2k 8k
Code Code Available 25 KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG Feb 13, 2025 Knowledge Graphs Large Language Model
Code Code Available 25 BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains Feb 15, 2024 Few-Shot Learning Medical Question Answering
Code Code Available 25 Grounded 3D-LLM with Referent Tokens May 16, 2024 Dense Captioning Diversity
Code Code Available 25 VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis Mar 29, 2024 Hallucination Image Captioning
Code Code Available 25 Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite Sep 15, 2023 Question Answering
Code Code Available 25 GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks Feb 11, 2024 Graph Question Answering Instruction Following
Code Code Available 25 GreaseLM: Graph REASoning Enhanced Language Models for Question Answering Jan 21, 2022 Knowledge Graphs Medical Question Answering
Code Code Available 25 Habitat: A Platform for Embodied AI Research Apr 2, 2019 Benchmarking GPU
Code Code Available 25 An Image Grid Can Be Worth a Video: Zero-shot Video Question Answering Using a VLM Mar 27, 2024 Language Modeling Language Modelling
Code Code Available 25 Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning Jul 29, 2024 Chart Question Answering Question Answering
Code Code Available 25 BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks May 26, 2023 Image Captioning Medical Visual Question Answering
Code Code Available 25 GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Aug 6, 2024 Question Answering Visual Question Answering
Code Code Available 25 Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment Nov 26, 2024 Image Quality Assessment Question Answering
Code Code Available 25 GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI Nov 21, 2024 Decision Making Language Modeling
Code Code Available 25 GOFA: A Generative One-For-All Model for Joint Graph Language Modeling Jul 12, 2024 All Language Modeling
Code Code Available 25 Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers Mar 22, 2024 Information Retrieval
Code Code Available 25 GeoChat: Grounded Large Vision-Language Model for Remote Sensing Nov 24, 2023 Instruction Following Language Modeling
Code Code Available 25 How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions Jul 6, 2024 Question Answering RAG
Code Code Available 25 BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions Aug 19, 2023 MME Optical Character Recognition (OCR)
Code Code Available 25 GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering Feb 4, 2024 Language Modeling Language Modelling
Code Code Available 25 Hungry Hungry Hippos: Towards Language Modeling with State Space Models Dec 28, 2022 8k Coreference Resolution
Code Code Available 25 BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities Dec 10, 2024 Medical Visual Question Answering Question Answering
Code Code Available 25 GIT: A Generative Image-to-text Transformer for Vision and Language May 27, 2022 Decoder Image Captioning
Code Code Available 25 KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques Mar 9, 2024 Knowledge Graphs Long Form Question Answering
Code Code Available 25 IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Apr 25, 2024 Cross-Lingual Question Answering Diversity
Code Code Available 25 LingoQA: Visual Question Answering for Autonomous Driving Dec 21, 2023 Autonomous Driving Decision Making
Code Code Available 25 An Embodied Generalist Agent in 3D World Nov 18, 2023 3D dense captioning 3D Question Answering (3D-QA)
Code Code Available 25 Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning Sep 19, 2024 Language Modeling Language Modelling
Code Code Available 25 From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks Jun 4, 2024 Image Captioning Language Modelling
Code Code Available 25