Holistic Evaluation of Language Models Nov 16, 2022 Fairness Question Answering
Code Code Available 4BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining Oct 19, 2022 Document Classification Language Modelling
Code Code Available 4ReAct: Synergizing Reasoning and Acting in Language Models Oct 6, 2022 Decision Making Fact Verification
Code Code Available 4N-Grammer: Augmenting Transformers with latent n-grams Jul 13, 2022 Common Sense Reasoning Coreference Resolution
Code Code Available 4Flamingo: a Visual Language Model for Few-Shot Learning Apr 29, 2022 Few-Shot Learning Generative Visual Question Answering
Code Code Available 4What Makes Good In-Context Examples for GPT-3? Jan 17, 2021 Few-Shot Learning Natural Language Understanding
Code Code Available 4Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks May 22, 2020 Fact Verification Question Answering
Code Code Available 4Predicting Subjective Features of Questions of QA Websites using BERT Feb 24, 2020 Community Question Answering Question Answering
Code Code Available 4L0: Reinforcement Learning to Become General Agents Jun 30, 2025 Question Answering reinforcement-learning
Code Code Available 3KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction May 29, 2025 Question Answering
Code Code Available 3Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models May 29, 2025 Autonomous Driving Diagnostic
Code Code Available 3InfoChartQA: A Benchmark for Multimodal Question Answering on Infographic Charts May 25, 2025 Chart Understanding Question Answering
Code Code Available 3LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis May 5, 2025 Chatbot Decoder
Code Code Available 3Ai2 Scholar QA: Organized Literature Synthesis with Attribution Apr 15, 2025 Question Answering Retrieval
Code Code Available 3Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook Mar 23, 2025 3D Generation Medical Report Generation
Code Code Available 3MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding Mar 18, 2025 document understanding Question Answering
Code Code Available 3VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Mar 17, 2025 Grounded Video Question Answering Question Answering
Code Code Available 3Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering Mar 14, 2025 Audio Question Answering Question Answering
Code Code Available 3SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment Mar 12, 2025 Autonomous Driving Bench2Drive
Code Code Available 3EgoLife: Towards Egocentric Life Assistant Mar 5, 2025 Question Answering Video Understanding
Code Code Available 3MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs Feb 24, 2025 Question Answering Visual Question Answering
Code Code Available 3Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction Feb 24, 2025 Language Modeling Language Modelling
Code Code Available 3Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding Feb 9, 2025 Image Captioning Image-text Retrieval
Code Code Available 3VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Jan 21, 2025 Image Generation Instruction Following
Code Code Available 3CAD-Recode: Reverse Engineering CAD Code from Point Clouds Dec 18, 2024 CAD Reconstruction Decoder
Code Code Available 3DARWIN 1.5: Large Language Models as Materials Science Adapted Learners Dec 16, 2024 Large Language Model Multi-Task Learning
Code Code Available 3Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent Nov 5, 2024 Benchmarking Hallucination
Code Code Available 3A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness Nov 4, 2024 Question Answering Text Generation
Code Code Available 3Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception Oct 16, 2024 Binary Classification Chunking
Code Code Available 3LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Oct 14, 2024 Benchmarking Large Language Model
Code Code Available 3ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation Sep 20, 2024 Descriptive Question Answering
Code Code Available 3RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework Aug 2, 2024 Benchmarking Dataset Generation
Code Code Available 3Odyssey: Empowering Minecraft Agents with Open-World Skills Jul 22, 2024 Language Modelling Large Language Model
Code Code Available 3Evaluating Large Language Models with fmeval Jul 15, 2024 Question Answering
Code Code Available 3Searching for Best Practices in Retrieval-Augmented Generation Jul 1, 2024 Question Answering RAG
Code Code Available 3Detecting hallucinations in large language models using semantic entropy Jun 19, 2024 Large Language Model Question Answering
Code Code Available 3VoCo-LLaMA: Towards Vision Compression with Large Language Models Jun 18, 2024 Computational Efficiency Question Answering
Code Code Available 3AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning Jun 17, 2024 Language Modeling Language Modelling
Code Code Available 3VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding Jun 13, 2024 Dense Video Captioning MVBench
Code Code Available 3Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams Jun 12, 2024 cross-modal alignment Language Modelling
Code Code Available 3Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning Jun 10, 2024 Multi-hop Question Answering Question Answering
Code Code Available 3CRAG -- Comprehensive RAG Benchmark Jun 7, 2024 Hallucination Language Modelling
Code Code Available 3GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning May 30, 2024 Graph Question Answering Knowledge Graphs
Code Code Available 3Hawk: Learning to Understand Open-World Video Anomalies May 27, 2024 Anomaly Detection Question Answering
Code Code Available 3Efficient Multimodal Large Language Models: A Survey May 17, 2024 Edge-computing Question Answering
Code Code Available 3From Matching to Generation: A Survey on Generative Information Retrieval Apr 23, 2024 Incremental Learning Information Retrieval
Code Code Available 3MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts Apr 22, 2024 Common Sense Reasoning GPU
Code Code Available 3View Selection for 3D Captioning via Diffusion Ranking Apr 11, 2024 3D Object Captioning Hallucination
Code Code Available 3MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Apr 8, 2024 GPU Multiple-choice
Code Code Available 3Evaluating Text-to-Visual Generation with Image-to-Text Generation Apr 1, 2024 Image to text Question Answering
Code Code Available 3