DataComp-LM: In search of the next generation of training sets for language models Jun 17, 2024 Language Modelling MMLU
Code Code Available 7Mixture-of-Agents Enhances Large Language Model Capabilities Jun 7, 2024 Language Modeling Language Modelling
Code Code Available 7Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond Apr 26, 2023 Language Modelling Natural Language Understanding
Code Code Available 6Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG Jan 15, 2025 Natural Language Understanding RAG
Code Code Available 5MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation Jun 25, 2024 Diversity Natural Language Understanding
Code Code Available 5MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts Apr 13, 2024 Diversity Language Modeling
Code Code Available 5TaskWeaver: A Code-First Agent Framework Nov 29, 2023 Natural Language Understanding
Code Code Available 5How to Design Translation Prompts for ChatGPT: An Empirical Study Apr 5, 2023 Machine Translation Natural Language Understanding
Code Code Available 5A Survey on Vision-Language-Action Models for Autonomous Driving Jun 30, 2025 Autonomous Driving Autonomous Vehicles
Code Code Available 4Decoder Tuning: Efficient Language Understanding as Decoding Dec 16, 2022 Decoder Natural Language Understanding
Code Code Available 4Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective Oct 16, 2022 Coreference Resolution Multiple-choice
Code Code Available 4DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation Oct 14, 2022 Natural Language Understanding Text Generation
Code Code Available 4What Makes Good In-Context Examples for GPT-3? Jan 17, 2021 Few-Shot Learning Natural Language Understanding
Code Code Available 4Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL May 22, 2025 Natural Language Understanding Reinforcement Learning (RL)
Code Code Available 3Large Language Model-Brained GUI Agents: A Survey Nov 27, 2024 Code Generation Language Modeling
Code Code Available 3SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation Nov 26, 2024 Natural Language Understanding Referring Video Object Segmentation
Code Code Available 3Tree Search for Language Model Agents Jul 1, 2024 Language Modeling Language Modelling
Code Code Available 3GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents Jun 7, 2024 Natural Language Understanding
Code Code Available 3MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline Jan 16, 2024 GSM8K Math
Code Code Available 3Efficient Large Language Models: A Survey Dec 6, 2023 Natural Language Understanding Survey
Code Code Available 3GLM: General Language Model Pretraining with Autoregressive Blank Infilling Mar 18, 2021 Abstractive Text Summarization Classification
Code Code Available 3ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding Oct 23, 2020 Language Modeling Language Modelling
Code Code Available 3Ludwig: a type-based declarative deep learning toolbox Sep 17, 2019 Decoder Deep Learning
Code Code Available 3BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Oct 11, 2018 Citation Intent Classification Common Sense Reasoning
Code Code Available 3Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 3Vision Language Action Models in Robotic Manipulation: A Systematic Review Jul 14, 2025 Dataset Generation Natural Language Understanding
Code Code Available 2Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities May 26, 2025 Knowledge Graphs Natural Language Understanding
Code Code Available 2An Empirical Study of Qwen3 Quantization May 4, 2025 Natural Language Understanding Quantization
Code Code Available 2LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation Apr 10, 2025 Code Generation Continual Learning
Code Code Available 2BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving Mar 5, 2025 Autonomous Driving Motion Planning
Code Code Available 2Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Feb 24, 2025 image-classification Image Classification
Code Code Available 2MCP-Solver: Integrating Language Models with Constraint Programming Systems Dec 31, 2024 Natural Language Understanding
Code Code Available 23DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Dec 24, 2024 Natural Language Understanding Scene Understanding
Code Code Available 2Large Language Model Safety: A Holistic Survey Dec 23, 2024 Language Modeling Language Modelling
Code Code Available 2Selective Aggregation for Low-Rank Adaptation in Federated Learning Oct 2, 2024 Federated Learning General Knowledge
Code Code Available 2Balancing LoRA Performance and Efficiency with Simple Shard Sharing Sep 19, 2024 Computational Efficiency GSM8K
Code Code Available 2Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning Aug 1, 2024 Language Modeling Language Modelling
Code Code Available 2LoRA-Pro: Are Low-Rank Adapters Properly Optimized? Jul 25, 2024 Code Generation Computational Efficiency
Code Code Available 2MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data Jun 26, 2024 Benchmarking Math
Code Code Available 2Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks Jun 19, 2024 Decoder Language Modeling
Code Code Available 2SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models May 23, 2024 Natural Language Understanding Quantization
Code Code Available 2Parameter-Efficient Fine-Tuning with Discrete Fourier Transform May 5, 2024 image-classification Image Classification
Code Code Available 2An empirical study of LLaMA3 quantization: from LLMs to MLLMs Apr 22, 2024 Language Modelling Large Language Model
Code Code Available 2CleanAgent: Automating Data Standardization with LLM-based Agents Mar 13, 2024 Code Generation Natural Language Understanding
Code Code Available 2SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis Mar 4, 2024 Benchmarking Drug Discovery
Code Code Available 2The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA Feb 28, 2024 Natural Language Understanding Question Answering
Code Code Available 2TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation Jan 25, 2024 Decoder Language Modeling
Code Code Available 2Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action Dec 28, 2023 Decoder Image Generation
Code Code Available 2Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning Oct 18, 2023 Natural Language Understanding
Code Code Available 2LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models Oct 12, 2023 Natural Language Understanding Quantization
Code Code Available 2