DataComp-LM: In search of the next generation of training sets for language models Jun 17, 2024 Language Modelling MMLU
Code Code Available 75 Mixture-of-Agents Enhances Large Language Model Capabilities Jun 7, 2024 Language Modeling Language Modelling
Code Code Available 75 Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond Apr 26, 2023 Language Modelling Natural Language Understanding
Code Code Available 65 TaskWeaver: A Code-First Agent Framework Nov 29, 2023 Natural Language Understanding
Code Code Available 55 How to Design Translation Prompts for ChatGPT: An Empirical Study Apr 5, 2023 Machine Translation Natural Language Understanding
Code Code Available 55 MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation Jun 25, 2024 Diversity Natural Language Understanding
Code Code Available 55 Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG Jan 15, 2025 Natural Language Understanding RAG
Code Code Available 55 MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts Apr 13, 2024 Diversity Language Modeling
Code Code Available 55 Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective Oct 16, 2022 Coreference Resolution Multiple-choice
Code Code Available 45 Decoder Tuning: Efficient Language Understanding as Decoding Dec 16, 2022 Decoder Natural Language Understanding
Code Code Available 45 A Survey on Vision-Language-Action Models for Autonomous Driving Jun 30, 2025 Autonomous Driving Autonomous Vehicles
Code Code Available 45 DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation Oct 14, 2022 Natural Language Understanding Text Generation
Code Code Available 45 What Makes Good In-Context Examples for GPT-3? Jan 17, 2021 Few-Shot Learning Natural Language Understanding
Code Code Available 45 MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline Jan 16, 2024 GSM8K Math
Code Code Available 35 GLM: General Language Model Pretraining with Autoregressive Blank Infilling Mar 18, 2021 Abstractive Text Summarization Classification
Code Code Available 35 Ludwig: a type-based declarative deep learning toolbox Sep 17, 2019 Decoder Deep Learning
Code Code Available 35 Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 35 GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents Jun 7, 2024 Natural Language Understanding
Code Code Available 35 Large Language Model-Brained GUI Agents: A Survey Nov 27, 2024 Code Generation Language Modeling
Code Code Available 35 ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding Oct 23, 2020 Language Modeling Language Modelling
Code Code Available 35 Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL May 22, 2025 Natural Language Understanding Reinforcement Learning (RL)
Code Code Available 35 Efficient Large Language Models: A Survey Dec 6, 2023 Natural Language Understanding Survey
Code Code Available 35 SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation Nov 26, 2024 Natural Language Understanding Referring Video Object Segmentation
Code Code Available 35 BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Oct 11, 2018 Citation Intent Classification Common Sense Reasoning
Code Code Available 35 Tree Search for Language Model Agents Jul 1, 2024 Language Modeling Language Modelling
Code Code Available 35 MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data Jun 26, 2024 Benchmarking Math
Code Code Available 25 MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages Apr 18, 2022 intent-classification Intent Classification
Code Code Available 25 MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models Sep 21, 2023 Arithmetic Reasoning GSM8K
Code Code Available 25 Autonomous GIS: the next-generation AI-powered GIS May 10, 2023 Code Generation Information Retrieval
Code Code Available 25 MCP-Solver: Integrating Language Models with Constraint Programming Systems Dec 31, 2024 Natural Language Understanding
Code Code Available 25 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Dec 24, 2024 Natural Language Understanding Scene Understanding
Code Code Available 25 LoRA-Pro: Are Low-Rank Adapters Properly Optimized? Jul 25, 2024 Code Generation Computational Efficiency
Code Code Available 25 Learning Transferable Visual Models From Natural Language Supervision Feb 26, 2021 Action Recognition Benchmarking
Code Code Available 25 LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation Apr 10, 2025 Code Generation Continual Learning
Code Code Available 25 Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Feb 24, 2025 image-classification Image Classification
Code Code Available 25 MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models Aug 17, 2023 Decision Making Hallucination
Code Code Available 25 It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners Sep 15, 2020 Natural Language Understanding
Code Code Available 25 An empirical study of LLaMA3 quantization: from LLMs to MLLMs Apr 22, 2024 Language Modelling Large Language Model
Code Code Available 25 JGLUE: Japanese General Language Understanding Evaluation Jun 1, 2022 FLUE Natural Language Understanding
Code Code Available 25 Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks Jun 19, 2024 Decoder Language Modeling
Code Code Available 25 GPT Understands, Too Mar 18, 2021 Knowledge Probing Language Modeling
Code Code Available 25 DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing Nov 18, 2021 Language Modeling Language Modelling
Code Code Available 25 Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning Aug 1, 2024 Language Modeling Language Modelling
Code Code Available 25 DeBERTa: Decoding-enhanced BERT with Disentangled Attention Jun 5, 2020 Common Sense Reasoning Coreference Resolution
Code Code Available 25 I-BERT: Integer-only BERT Quantization Jan 5, 2021 GPU Natural Language Inference
Code Code Available 25 ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT Apr 27, 2020 Document Ranking Information Retrieval
Code Code Available 25 BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models Sep 12, 2023 Diagnostic Natural Language Understanding
Code Code Available 25 Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities May 26, 2025 Knowledge Graphs Natural Language Understanding
Code Code Available 25 LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models Oct 12, 2023 Natural Language Understanding Quantization
Code Code Available 25 DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI Jul 19, 2023 Conversational Recommendation Diversity
Code Code Available 25