A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic, and Multimodal Dec 12, 2022 General Knowledge Graph Embedding
Code Code Available 35 A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness Nov 4, 2024 Question Answering Text Generation
Code Code Available 35 Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 35 EgoLife: Towards Egocentric Life Assistant Mar 5, 2025 Question Answering Video Understanding
Code Code Available 35 Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook Mar 23, 2025 3D Generation Medical Report Generation
Code Code Available 35 3D-LLM: Injecting the 3D World into Large Language Models Jul 24, 2023 3D Object Captioning 3D Question Answering (3D-QA)
Code Code Available 35 DriveLM: Driving with Graph Visual Question Answering Dec 21, 2023 Autonomous Driving Question Answering
Code Code Available 35 Prompting Is Programming: A Query Language for Large Language Models Dec 12, 2022 Code Generation Language Modeling
Code Code Available 35 PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers Feb 13, 2024 Question Answering Retrieval
Code Code Available 35 PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models Mar 26, 2024 Code Completion Few-Shot Learning
Code Code Available 35 Ai2 Scholar QA: Organized Literature Synthesis with Attribution Apr 15, 2025 Question Answering Retrieval
Code Code Available 35 ST-MoE: Designing Stable and Transferable Sparse Expert Models Feb 17, 2022 ARC Common Sense Reasoning
Code Code Available 35 Detecting hallucinations in large language models using semantic entropy Jun 19, 2024 Large Language Model Question Answering
Code Code Available 35 ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities May 18, 2023 1 Image, 2*2 Stitchi Action Classification
Code Code Available 35 All You May Need for VQA are Image Captions May 4, 2022 All Image Captioning
Code Code Available 35 Odyssey: Empowering Minecraft Agents with Open-World Skills Jul 22, 2024 Language Modelling Large Language Model
Code Code Available 35 Evaluating Hallucinations in Chinese Large Language Models Oct 5, 2023 Hallucination Question Answering
Code Code Available 35 Self-QA: Unsupervised Knowledge Guided Language Model Alignment May 19, 2023 Diversity Language Modeling
Code Code Available 35 CRAG -- Comprehensive RAG Benchmark Jun 7, 2024 Hallucination Language Modelling
Code Code Available 35 MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs Feb 24, 2025 Question Answering Visual Question Answering
Code Code Available 35 CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models Jan 30, 2024 Knowledge Base Construction Question Answering
Code Code Available 35 MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts Apr 22, 2024 Common Sense Reasoning GPU
Code Code Available 35 Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception Oct 16, 2024 Binary Classification Chunking
Code Code Available 35 MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding Mar 18, 2025 document understanding Question Answering
Code Code Available 35 RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework Aug 2, 2024 Benchmarking Dataset Generation
Code Code Available 35 Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models Nov 11, 2023 Image Captioning MMR total
Code Code Available 35 LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Oct 14, 2024 Benchmarking Large Language Model
Code Code Available 35 M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models Mar 31, 2024 Image-text Retrieval Language Modeling
Code Code Available 35 Longformer: The Long-Document Transformer Apr 10, 2020 Decoder Language Modeling
Code Code Available 35 A Survey of Large Language Models in Finance (FinLLMs) Feb 4, 2024 Named Entity Recognition (NER) Question Answering
Code Code Available 35 Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering Mar 14, 2025 Audio Question Answering Question Answering
Code Code Available 35 ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation Sep 20, 2024 Descriptive Question Answering
Code Code Available 35 MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Apr 8, 2024 GPU Multiple-choice
Code Code Available 35 Champion Solution for the WSDM2023 Toloka VQA Challenge Jan 22, 2023 Question Answering Visual Grounding
Code Code Available 35 LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis May 5, 2025 Chatbot Decoder
Code Code Available 35 L0: Reinforcement Learning to Become General Agents Jun 30, 2025 Question Answering reinforcement-learning
Code Code Available 35 ERNIE 2.0: A Continual Pre-training Framework for Language Understanding Jul 29, 2019 Chinese Named Entity Recognition Chinese Reading Comprehension
Code Code Available 35 Language Models are Few-Shot Learners May 28, 2020 answerability prediction Articles
Code Code Available 35 CAD-Recode: Reverse Engineering CAD Code from Point Clouds Dec 18, 2024 CAD Reconstruction Decoder
Code Code Available 35 KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction May 29, 2025 Question Answering
Code Code Available 35 Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought Oct 3, 2022 Mathematical Reasoning Question Answering
Code Code Available 35 Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity Mar 21, 2024 Question Answering RAG
Code Code Available 35 Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning Jun 10, 2024 Multi-hop Question Answering Question Answering
Code Code Available 35 Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models May 29, 2025 Autonomous Driving Diagnostic
Code Code Available 35 BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Oct 11, 2018 Citation Intent Classification Common Sense Reasoning
Code Code Available 35 SVIT: Scaling up Visual Instruction Tuning Jul 9, 2023 Diversity Image Captioning
Code Code Available 35 Evaluating Large Language Models with fmeval Jul 15, 2024 Question Answering
Code Code Available 35 Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent Nov 5, 2024 Benchmarking Hallucination
Code Code Available 35 DARWIN 1.5: Large Language Models as Materials Science Adapted Learners Dec 16, 2024 Large Language Model Multi-Task Learning
Code Code Available 35 Hawk: Learning to Understand Open-World Video Anomalies May 27, 2024 Anomaly Detection Question Answering
Code Code Available 35