| A-MEM: Agentic Memory for LLM Agents | Feb 17, 2025 | Large Language Model | CodeCode Available | 4 |
| LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models | Jan 31, 2025 | Caption GenerationLanguage Modeling | CodeCode Available | 4 |
| Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment | Jan 16, 2025 | Causal Inferencecounterfactual | CodeCode Available | 4 |
| LLM4AD: A Platform for Algorithm Design with Large Language Model | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Liquid: Language Models are Scalable Multi-modal Generators | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL | Nov 13, 2024 | DiversityIn-Context Learning | CodeCode Available | 4 |
| SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement | Oct 26, 2024 | Large Language Model | CodeCode Available | 4 |
| Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration | Oct 3, 2024 | DiversityLanguage Modeling | CodeCode Available | 4 |
| OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data | Oct 2, 2024 | Arithmetic ReasoningLarge Language Model | CodeCode Available | 4 |
| Data-Prep-Kit: getting your data ready for LLM application development | Sep 26, 2024 | CPULanguage Modeling | CodeCode Available | 4 |
| HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling | Sep 19, 2024 | Large Language ModelRecommendation Systems | CodeCode Available | 4 |
| Large Language Model-Based Agents for Software Engineering: A Survey | Sep 4, 2024 | AI AgentLanguage Modeling | CodeCode Available | 4 |
| Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation | Aug 8, 2024 | ChunkingFact Checking | CodeCode Available | 4 |
| When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world Environments | Jul 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine | Jul 11, 2024 | Contrastive LearningLanguage Modelling | CodeCode Available | 4 |
| SEED-Story: Multimodal Long Story Generation with Large Language Model | Jul 11, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 4 |
| YuLan: An Open-source Large Language Model | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| AgentGym: Evolving Large Language Model-based Agents across Diverse Environments | Jun 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models | Jun 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct | May 23, 2024 | Class-level Code GenerationCode Completion | CodeCode Available | 4 |
| LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit | May 9, 2024 | BenchmarkingComputational Efficiency | CodeCode Available | 4 |
| SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing | May 7, 2024 | Image ManipulationLanguage Modeling | CodeCode Available | 4 |
| QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving | May 7, 2024 | GPULanguage Modelling | CodeCode Available | 4 |
| Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| AutoWebGLM: A Large Language Model-based Web Navigating Agent | Apr 4, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| A Survey on Large Language Model-Based Game Agents | Apr 2, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation | Feb 28, 2024 | AttributeExtractive Question-Answering | CodeCode Available | 4 |
| Tower: An Open Multilingual Large Language Model for Translation-Related Tasks | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| LLM Inference Unveiled: Survey and Roofline Model Insights | Feb 26, 2024 | Knowledge DistillationLanguage Modelling | CodeCode Available | 4 |
| RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation | Feb 26, 2024 | Code Documentation GenerationCode Generation | CodeCode Available | 4 |
| Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step | Feb 25, 2024 | Code GenerationHumanEval | CodeCode Available | 4 |
| AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling | Feb 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Generative Representational Instruction Tuning | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model | Dec 28, 2023 | Instance SegmentationLanguage Modeling | CodeCode Available | 4 |
| G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects | Dec 13, 2023 | 3D Object Detection3D Object Tracking | CodeCode Available | 4 |
| Video-LLaVA: Learning United Visual Representation by Alignment Before Projection | Nov 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models | Nov 13, 2023 | Described Object DetectionLanguage Modeling | CodeCode Available | 4 |
| mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration | Nov 7, 2023 | 1 Image, 2*2 StitchingDecoder | CodeCode Available | 4 |
| DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models | Sep 25, 2023 | Language ModellingLarge Language Model | CodeCode Available | 4 |
| Safurai 001: New Qualitative Approach for Code LLM Evaluation | Sep 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| A Survey on Large Language Model based Autonomous Agents | Aug 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| ChatHaruhi: Reviving Anime Character in Reality via Large Language Model | Aug 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| LISA: Reasoning Segmentation via Large Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| How is ChatGPT's behavior changing over time? | Jul 18, 2023 | Code GenerationLanguage Modelling | CodeCode Available | 4 |
| INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks | May 18, 2023 | DecoderLanguage Modeling | CodeCode Available | 4 |
| Phoenix: Democratizing ChatGPT across Languages | Apr 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |