DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Jan 22, 2025 Mathematical Reasoning Multi-task Language Understanding
Code Code Available 15From Local to Global: A Graph RAG Approach to Query-Focused Summarization Apr 24, 2024 Query-focused Summarization Question Answering
Code Code Available 13SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Aug 10, 2024 Hallucination Optical Character Recognition
Code Code Available 11WebWalker: Benchmarking LLMs in Web Traversal Jan 13, 2025 Benchmarking Open-Domain Question Answering
Code Code Available 11Visually Descriptive Language Model for Vector Graphics Reasoning Apr 9, 2024 Descriptive Language Modeling
Code Code Available 9KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation Sep 10, 2024 Knowledge Graphs Question Answering
Code Code Available 9BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Jun 14, 2024 Question Answering Retrieval-augmented Generation
Code Code Available 9DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Dec 13, 2024 Chart Understanding Mixture-of-Experts
Code Code Available 9Llama 2: Open Foundation and Fine-Tuned Chat Models Jul 18, 2023 Arithmetic Reasoning
Code Code Available 8LLM-AutoDiff: Auto-Differentiate Any LLM Workflow Jan 28, 2025 Prompt Engineering Question Answering
Code Code Available 7MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning Oct 14, 2023 Image Classification Image Description
Code Code Available 7Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP Dec 28, 2022 In-Context Learning Language Modelling
Code Code Available 7MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery Sep 9, 2024 Memorization Question Answering
Code Code Available 7DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines Oct 5, 2023 Language Modeling Language Modelling
Code Code Available 7TextGrad: Automatic "Differentiation" via Text Jun 11, 2024 Question Answering Specificity
Code Code Available 7Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Aug 5, 2024 Decoder Depth Estimation
Code Code Available 7HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models May 23, 2024 Hippocampus Knowledge Graphs
Code Code Available 7Kimi-Audio Technical Report Apr 25, 2025 Audio Question Answering Question Answering
Code Code Available 7GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot Dec 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 7Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant Oct 20, 2024 Question Answering speech-recognition
Code Code Available 7Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Mar 12, 2025 Question Answering RAG
Code Code Available 7Chameleon: Mixed-Modal Early-Fusion Foundation Models May 16, 2024 Image Captioning Image Generation
Code Code Available 7When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models May 16, 2024 In-Context Learning Question Answering
Code Code Available 7V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning Jun 11, 2025 Action Anticipation Large Language Model
Code Code Available 7LLaMA: Open and Efficient Foundation Language Models Feb 27, 2023 Arithmetic Reasoning Code Generation
Code Code Available 7LLaVA-CoT: Let Vision Language Models Reason Step-by-Step Nov 15, 2024 Logical Reasoning Multimodal Reasoning
Code Code Available 7Scaling Speech-Text Pre-training with Synthetic Interleaved Data Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 7PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods Jul 9, 2024 Information Retrieval LEMMA
Code Code Available 7Training Compute-Optimal Large Language Models Mar 29, 2022 Anachronisms Analogical Similarity
Code Code Available 6Training language models to follow instructions with human feedback Mar 4, 2022 Question Answering
Code Code Available 6LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models Sep 21, 2023 4k GPU
Code Code Available 6Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Jan 28, 2022 Common Sense Reasoning GSM8K
Code Code Available 6Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Apr 3, 2023 Common Sense Reasoning Coreference Resolution
Code Code Available 6GPT-4 Technical Report Mar 15, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 6Automatic Chain of Thought Prompting in Large Language Models Oct 7, 2022 Diversity Question Answering
Code Code Available 6h2oGPT: Democratizing Large Language Models Jun 13, 2023 Chatbot Fairness
Code Code Available 6Mistral 7B Oct 10, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 6RET-LLM: Towards a General Read-Write Memory for Large Language Models May 23, 2023 Question Answering
Code Code Available 6TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document Mar 7, 2024 document understanding Key Information Extraction
Code Code Available 5Tree of Thoughts: Deliberate Problem Solving with Large Language Models May 17, 2023 Arithmetic Reasoning Decision Making
Code Code Available 5TrustRAG: An Information Assistant with Retrieval Augmented Generation Feb 19, 2025 Answer Generation Chunking
Code Code Available 5Continuous Thought Machines May 8, 2025 Computational Efficiency Question Answering
Code Code Available 5Show-o: One Single Transformer to Unify Multimodal Understanding and Generation Aug 22, 2024 10-shot image generation
Code Code Available 5BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval Jul 16, 2024 Question Answering Retrieval
Code Code Available 5Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine Nov 28, 2023 Electrical Engineering Experimental Design
Code Code Available 5KBLaM: Knowledge Base augmented Language Model Oct 14, 2024 8k GPU
Code Code Available 5Search-o1: Agentic Search-Enhanced Large Reasoning Models Jan 9, 2025 Code Generation
Code Code Available 5Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond Aug 24, 2023 Chart Question Answering FS-MEVQA
Code Code Available 5RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism Jun 30, 2025 Question Answering RAG
Code Code Available 5Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding Apr 14, 2025 Question Answering
Code Code Available 5