| Cosmos World Foundation Model Platform for Physical AI | Jan 7, 2025 | modelPosition | CodeCode Available | 5 | 5 |
| Reservoir-enhanced Segment Anything Model for Subsurface Diagnosis | Apr 26, 2025 | Anomaly DetectionGPR | CodeCode Available | 5 | 5 |
| CogAgent: A Visual Language Model for GUI Agents | Dec 14, 2023 | Language Modeling | CodeCode Available | 5 | 5 |
| LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model | Apr 28, 2023 | Instruction Followingmodel | CodeCode Available | 5 | 5 |
| ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge | Mar 24, 2023 | Information RetrievalLanguage Modeling | CodeCode Available | 4 | 5 |
| LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation | Nov 7, 2024 | Contrastive LearningImage Captioning | CodeCode Available | 4 | 5 |
| LLM Inference Unveiled: Survey and Roofline Model Insights | Feb 26, 2024 | Knowledge DistillationLanguage Modelling | CodeCode Available | 4 | 5 |
| EdgeTAM: On-Device Track Anything Model | Jan 13, 2025 | modelVideo Segmentation | CodeCode Available | 4 | 5 |
| LISA: Reasoning Segmentation via Large Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| KTO: Model Alignment as Prospect Theoretic Optimization | Feb 2, 2024 | Attributemodel | CodeCode Available | 4 | 5 |