PEDANTS: Cheap but Effective and Interpretable Answer Equivalence Feb 17, 2024 Benchmarking Form
Code Code Available 2AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator Feb 15, 2024 Benchmarking Diagnostic
Code Code Available 2BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains Feb 15, 2024 Few-Shot Learning Medical Question Answering
Code Code Available 2GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks Feb 11, 2024 Graph Question Answering Instruction Following
Code Code Available 2Verif.ai: Towards an Open-Source Scientific Generative Question-Answering System with Referenced and Verifiable Answers Feb 9, 2024 Generative Question Answering Information Retrieval
Code Code Available 2CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion Feb 8, 2024 Computational Efficiency Multimodal Reasoning
Code Code Available 2ScreenAI: A Vision-Language Model for UI and Infographics Understanding Feb 7, 2024 Chart Question Answering Language Modeling
Code Code Available 2Position: What Can Large Language Models Tell Us about Time Series Analysis Feb 5, 2024 Decision Making Position
Code Code Available 2GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering Feb 4, 2024 Language Modeling Language Modelling
Code Code Available 2Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models Jan 27, 2024 Medical Question Answering Multiple-choice
Code Code Available 2InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions Jan 24, 2024 document understanding Question Answering
Code Code Available 2Can AI Assistants Know What They Don't Know? Jan 24, 2024 Math Open-Domain Question Answering
Code Code Available 2Tuning Language Models by Proxy Jan 16, 2024 Domain Adaptation Math
Code Code Available 2MMToM-QA: Multimodal Theory of Mind Question Answering Jan 16, 2024 Question Answering Theory of Mind Modeling
Code Code Available 2EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records Jan 13, 2024 Code Generation Few-Shot Learning
Code Code Available 2Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding Jan 9, 2024 Fact Verification In-Context Learning
Code Code Available 2Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks Jan 5, 2024 Arithmetic Reasoning Code Generation
Code Code Available 2PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging Jan 5, 2024 Medical Report Generation Medical Visual Question Answering
Code Code Available 2VCoder: Versatile Vision Encoders for Multimodal Large Language Models Dec 21, 2023 Image Captioning Image Generation
Code Code Available 2LingoQA: Visual Question Answering for Autonomous Driving Dec 21, 2023 Autonomous Driving Decision Making
Code Code Available 2Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy Dec 20, 2023 Language Modeling Language Modelling
Code Code Available 2Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers Dec 13, 2023 3D Question Answering (3D-QA) Attribute
Code Code Available 2OneLLM: One Framework to Align All Modalities with Language Dec 6, 2023 All Question Answering
Code Code Available 2Towards Learning a Generalist Model for Embodied Navigation Dec 4, 2023 3D Question Answering (3D-QA) Embodied Question Answering
Code Code Available 2LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning Nov 30, 2023 3D dense captioning Dense Captioning
Code Code Available 2LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models Nov 28, 2023 Image Captioning Question Answering
Code Code Available 2LLMGA: Multimodal Large Language Model based Generation Assistant Nov 27, 2023 Image Generation Language Modeling
Code Code Available 2GeoChat: Grounded Large Vision-Language Model for Remote Sensing Nov 24, 2023 Instruction Following Language Modeling
Code Code Available 2FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design Nov 23, 2023 Decision Making Language Modelling
Code Code Available 2PG-Video-LLaVA: Pixel Grounding Large Video-Language Models Nov 22, 2023 Benchmarking Phrase Grounding
Code Code Available 2An Embodied Generalist Agent in 3D World Nov 18, 2023 3D dense captioning 3D Question Answering (3D-QA)
Code Code Available 2Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training Nov 15, 2023 Passage Retrieval Position
Code Code Available 2Learning to Filter Context for Retrieval-Augmented Generation Nov 14, 2023 Extractive Question-Answering Fact Verification
Code Code Available 2Agent Lumos: Unified and Modular Training for Open-Source Language Agents Nov 9, 2023 Math Question Answering
Code Code Available 2DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning Oct 23, 2023 Language Modeling Language Modelling
Code Code Available 2Frozen Transformers in Language Models Are Effective Visual Encoder Layers Oct 19, 2023 Action Recognition Image-text Retrieval
Code Code Available 2From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models Oct 13, 2023 Hallucination Image Captioning
Code Code Available 2ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models Oct 13, 2023 Knowledge Base Question Answering Knowledge Graphs
Code Code Available 2LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models Oct 12, 2023 Natural Language Understanding Quantization
Code Code Available 2Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models Oct 11, 2023 Code Generation Image Generation
Code Code Available 2Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning Oct 10, 2023 Language Modeling Language Modelling
Code Code Available 2Compressing Context to Enhance Inference Efficiency of Large Language Models Oct 9, 2023 Articles Question Answering
Code Code Available 2Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Oct 6, 2023 Code Generation Decision Making
Code Code Available 2MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts Oct 3, 2023 Chatbot Image Captioning
Code Code Available 2Representation Engineering: A Top-Down Approach to AI Transparency Oct 2, 2023 Question Answering
Code Code Available 2Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering Sep 29, 2023 Image to text Passage Retrieval
Code Code Available 2StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding Sep 20, 2023 Chart Question Answering Chart Understanding
Code Code Available 2Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite Sep 15, 2023 Question Answering
Code Code Available 2Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following Sep 1, 2023 3D Generation 3D Question Answering (3D-QA)
Code Code Available 2Knowledge Graph Prompting for Multi-Document Question Answering Aug 22, 2023 graph construction Open-Domain Question Answering
Code Code Available 2