| TinyLlama: An Open-Source Small Language Model | Jan 4, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 11 |
| SkyReels-V2: Infinite-length Film Generative Model | Apr 17, 2025 | Large Language Modelmodel | CodeCode Available | 9 |
| OpenVLA: An Open-Source Vision-Language-Action Model | Jun 13, 2024 | Imitation LearningLanguage Modelling | CodeCode Available | 9 |
| ORPO: Monolithic Preference Optimization without Reference Model | Mar 12, 2024 | model | CodeCode Available | 9 |
| PowerPM: Foundation Model for Power Systems | Aug 7, 2024 | Contrastive Learningmodel | CodeCode Available | 7 |
| GenAD: Generalized Predictive Model for Autonomous Driving | Mar 14, 2024 | Autonomous Drivingmodel | CodeCode Available | 7 |
| SoftTiger: A Clinical Foundation Model for Healthcare Workflows | Mar 1, 2024 | Language ModellingLarge Language Model | CodeCode Available | 7 |
| VMamba: Visual State Space Model | Jan 18, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 7 |
| SGLang: Efficient Execution of Structured Language Model Programs | Dec 12, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 6 |
| Direct Preference Optimization: Your Language Model is Secretly a Reward Model | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Gorilla: Large Language Model Connected with Massive APIs | May 24, 2023 | HallucinationLanguage Modeling | CodeCode Available | 6 |
| GLM-130B: An Open Bilingual Pre-trained Model | Oct 5, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 6 |
| Matrix-Game: Interactive World Foundation Model | Jun 23, 2025 | Minecraftmodel | CodeCode Available | 5 |
| Reservoir-enhanced Segment Anything Model for Subsurface Diagnosis | Apr 26, 2025 | Anomaly DetectionGPR | CodeCode Available | 5 |
| Magma: A Foundation Model for Multimodal AI Agents | Feb 18, 2025 | Autonomous Web NavigationImage to text | CodeCode Available | 5 |
| Cosmos World Foundation Model Platform for Physical AI | Jan 7, 2025 | modelPosition | CodeCode Available | 5 |
| KBLaM: Knowledge Base augmented Language Model | Oct 14, 2024 | 8kGPU | CodeCode Available | 5 |
| Uni-Mol2: Exploring Molecular Pretraining Model at Scale | Jun 21, 2024 | model | CodeCode Available | 5 |
| Evolutionary Optimization of Model Merging Recipes | Mar 19, 2024 | Evolutionary AlgorithmsMath | CodeCode Available | 5 |
| VideoMamba: State Space Model for Efficient Video Understanding | Mar 11, 2024 | Action ClassificationMamba | CodeCode Available | 5 |
| Repetition Improves Language Model Embeddings | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| CogAgent: A Visual Language Model for GUI Agents | Dec 14, 2023 | Language Modeling | CodeCode Available | 5 |
| LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model | Apr 28, 2023 | Instruction Followingmodel | CodeCode Available | 5 |
| Assessing Language Model Deployment with Risk Cards | Mar 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| WorldVLA: Towards Autoregressive Action World Model | Jun 26, 2025 | Action Generationmodel | CodeCode Available | 4 |
| RewardBench 2: Advancing Reward Model Evaluation | Jun 2, 2025 | Instruction Followingmodel | CodeCode Available | 4 |
| Unified Reward Model for Multimodal Understanding and Generation | Mar 7, 2025 | Image Generationmodel | CodeCode Available | 4 |
| SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model Inference | Feb 25, 2025 | modelVideo Generation | CodeCode Available | 4 |
| Molecular-driven Foundation Model for Oncologic Pathology | Jan 28, 2025 | BenchmarkingDiagnostic | CodeCode Available | 4 |
| DiffuEraser: A Diffusion Model for Video Inpainting | Jan 17, 2025 | modelOptical Flow Estimation | CodeCode Available | 4 |
| EdgeTAM: On-Device Track Anything Model | Jan 13, 2025 | modelVideo Segmentation | CodeCode Available | 4 |
| MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning | Dec 12, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 4 |
| Weighted-Reward Preference Optimization for Implicit Model Fusion | Dec 4, 2024 | model | CodeCode Available | 4 |
| Multimodal Whole Slide Foundation Model for Pathology | Nov 29, 2024 | Cross-Modal Retrievalmodel | CodeCode Available | 4 |
| LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation | Nov 7, 2024 | Contrastive LearningImage Captioning | CodeCode Available | 4 |
| TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters | Oct 30, 2024 | model | CodeCode Available | 4 |
| LAMBDA: A Large Model Based Data Agent | Jul 24, 2024 | model | CodeCode Available | 4 |
| YuLan: An Open-source Large Language Model | Jun 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment | May 2, 2024 | modelparameter-efficient fine-tuning | CodeCode Available | 4 |
| Self-Play Preference Optimization for Language Model Alignment | May 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| UniTS: A Unified Multi-Task Time Series Model | Feb 29, 2024 | Anomaly DetectionImputation | CodeCode Available | 4 |
| Diffusion Model-Based Image Editing: A Survey | Feb 27, 2024 | DenoisingImage Generation | CodeCode Available | 4 |
| LLM Inference Unveiled: Survey and Roofline Model Insights | Feb 26, 2024 | Knowledge DistillationLanguage Modelling | CodeCode Available | 4 |
| Spirit LM: Interleaved Spoken and Written Language Model | Feb 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Image Fusion via Vision-Language Model | Feb 3, 2024 | DecoderLanguage Modeling | CodeCode Available | 4 |
| KTO: Model Alignment as Prospect Theoretic Optimization | Feb 2, 2024 | Attributemodel | CodeCode Available | 4 |
| OtterHD: A High-Resolution Multi-modality Model | Nov 7, 2023 | modelVisual Question Answering | CodeCode Available | 4 |
| LISA: Reasoning Segmentation via Large Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Recognize Anything: A Strong Image Tagging Model | Jun 6, 2023 | modelSemantic Parsing | CodeCode Available | 4 |
| Reasoning with Language Model is Planning with World Model | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |