| OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| 4D Panoptic Scene Graph Generation | May 16, 2024 | 4D Panoptic SegmentationGraph Generation | CodeCode Available | 3 |
| OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale | Jul 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Baichuan-Omni Technical Report | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Evalverse: Unified and Accessible Library for Large Language Model Evaluation | Apr 1, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 3 |
| Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography | Mar 26, 2024 | Anomaly DetectionLarge Language Model | CodeCode Available | 3 |
| Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction | Feb 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems | Mar 5, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Apr 9, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Odyssey: Empowering Minecraft Agents with Open-World Skills | Jul 22, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 |
| AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Evaluation Report on MCP Servers | Apr 15, 2025 | Large Language Model | CodeCode Available | 3 |
| OceanGPT: A Large Language Model for Ocean Science Tasks | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| OpenGraph: Towards Open Graph Foundation Models | Mar 2, 2024 | Data AugmentationGraph Learning | CodeCode Available | 3 |
| Partially Rewriting a Transformer in Natural Language | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Mar 11, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models | May 1, 2025 | Large Language Model | CodeCode Available | 3 |
| Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts | Jul 9, 2024 | 3D Object Editing3D Reconstruction | CodeCode Available | 3 |
| GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning | May 30, 2024 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 3 |
| MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Apr 8, 2024 | Image GenerationImage-to-Image Translation | CodeCode Available | 3 |
| Multi-agent Architecture Search via Agentic Supernet | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ATPrompt: Textual Prompt Learning with Embedded Attributes | Dec 12, 2024 | AttributeLarge Language Model | CodeCode Available | 3 |
| AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Survey on the Optimization of Large Language Model-based Agents | Mar 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Survey on Large Language Model Acceleration based on KV Cache Management | Dec 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Survey on the Memory Mechanism of Large Language Model based Agents | Apr 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| CHESS: Contextual Harnessing for Efficient SQL Synthesis | May 27, 2024 | Large Language ModelPrivacy Preserving | CodeCode Available | 3 |
| A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray Interpretation | Jan 22, 2024 | BenchmarkingDiagnostic | CodeCode Available | 3 |
| Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Feb 8, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| Multimodal Table Understanding | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models | Mar 31, 2024 | Image-text RetrievalLanguage Modeling | CodeCode Available | 3 |
| HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Detecting hallucinations in large language models using semantic entropy | Jun 19, 2024 | Large Language ModelQuestion Answering | CodeCode Available | 3 |
| HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Jan 24, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Jun 3, 2025 | Decision MakingDiagnostic | CodeCode Available | 3 |
| LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory | Oct 14, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 3 |
| MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot | Feb 6, 2025 | DiagnosticLarge Language Model | CodeCode Available | 3 |
| LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale | Aug 10, 2024 | GPULanguage Modelling | CodeCode Available | 3 |
| APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Llemma: An Open Language Model For Mathematics | Oct 16, 2023 | Arithmetic ReasoningAutomated Theorem Proving | CodeCode Available | 3 |
| Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference | Mar 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training | Oct 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| DARWIN 1.5: Large Language Models as Materials Science Adapted Learners | Dec 16, 2024 | Large Language ModelMulti-Task Learning | CodeCode Available | 3 |
| AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls | Feb 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification | Apr 16, 2024 | Feature EngineeringLanguage Modeling | CodeCode Available | 3 |
| GroundingGPT:Language Enhanced Multi-modal Grounding Model | Jan 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 |
| Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model | Jan 4, 2024 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 3 |
| CRAG -- Comprehensive RAG Benchmark | Jun 7, 2024 | HallucinationLanguage Modelling | CodeCode Available | 3 |