| Partially Rewriting a Transformer in Natural Language | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Jan 24, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model | Jan 21, 2025 | Image GenerationInstruction Following | CodeCode Available | 3 |
| Lifelong Learning of Large Language Model based Agents: A Roadmap | Jan 13, 2025 | Incremental LearningLanguage Modeling | CodeCode Available | 3 |
| Valley2: Exploring Multimodal Models with Scalable Vision-Language Design | Jan 10, 2025 | Image CaptioningLanguage Modeling | CodeCode Available | 3 |
| LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases | Jan 6, 2025 | FairnessLanguage Modeling | CodeCode Available | 3 |
| A Survey on Large Language Model Acceleration based on KV Cache Management | Dec 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| DARWIN 1.5: Large Language Models as Materials Science Adapted Learners | Dec 16, 2024 | Large Language ModelMulti-Task Learning | CodeCode Available | 3 |
| ATPrompt: Textual Prompt Learning with Embedded Attributes | Dec 12, 2024 | AttributeLarge Language Model | CodeCode Available | 3 |
| From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Large Language Model-Brained GUI Agents: A Survey | Nov 27, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 3 |
| Pushing the Limits of Large Language Model Quantization via the Linearity Theorem | Nov 26, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment | Nov 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications | Nov 7, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 3 |
| COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training | Oct 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory | Oct 14, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 3 |
| Baichuan-Omni Technical Report | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Towards Next-Generation LLM-based Recommender Systems: A Survey and Beyond | Oct 10, 2024 | Large Language ModelRecommendation Systems | CodeCode Available | 3 |
| LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management | Oct 1, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale | Sep 25, 2024 | Large Language Model | CodeCode Available | 3 |
| LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale | Aug 10, 2024 | GPULanguage Modelling | CodeCode Available | 3 |
| OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale | Jul 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Odyssey: Empowering Minecraft Agents with Open-World Skills | Jul 22, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 |