| LAMBDA: A Large Model Based Data Agent | Jul 24, 2024 | model | CodeCode Available | 4 | 5 |
| KTO: Model Alignment as Prospect Theoretic Optimization | Feb 2, 2024 | Attributemodel | CodeCode Available | 4 | 5 |
| SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model Inference | Feb 25, 2025 | modelVideo Generation | CodeCode Available | 4 | 5 |
| Desiderata for next generation of ML model serving | Oct 26, 2022 | modelPosition | CodeCode Available | 4 | 5 |
| Spirit LM: Interleaved Spoken and Written Language Model | Feb 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| Thin-Plate Spline Motion Model for Image Animation | Mar 27, 2022 | Face ReenactmentImage Animation | CodeCode Available | 4 | 5 |
| Image Fusion via Vision-Language Model | Feb 3, 2024 | DecoderLanguage Modeling | CodeCode Available | 4 | 5 |
| HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge | Apr 14, 2023 | model | CodeCode Available | 4 | 5 |
| Human Motion Diffusion Model | Sep 29, 2022 | 3D Generationmodel | CodeCode Available | 4 | 5 |
| LISA: Reasoning Segmentation via Large Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| Galactica: A Large Language Model for Science | Nov 16, 2022 | AnachronismsBias Detection | CodeCode Available | 4 | 5 |
| Recognize Anything: A Strong Image Tagging Model | Jun 6, 2023 | modelSemantic Parsing | CodeCode Available | 4 | 5 |
| OtterHD: A High-Resolution Multi-modality Model | Nov 7, 2023 | modelVisual Question Answering | CodeCode Available | 4 | 5 |
| NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment | May 2, 2024 | modelparameter-efficient fine-tuning | CodeCode Available | 4 | 5 |
| Multimodal Whole Slide Foundation Model for Pathology | Nov 29, 2024 | Cross-Modal Retrievalmodel | CodeCode Available | 4 | 5 |
| Reasoning with Language Model is Planning with World Model | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| RewardBench 2: Advancing Reward Model Evaluation | Jun 2, 2025 | Instruction Followingmodel | CodeCode Available | 4 | 5 |
| EdgeTAM: On-Device Track Anything Model | Jan 13, 2025 | modelVideo Segmentation | CodeCode Available | 4 | 5 |
| LLM Inference Unveiled: Survey and Roofline Model Insights | Feb 26, 2024 | Knowledge DistillationLanguage Modelling | CodeCode Available | 4 | 5 |
| Molecular-driven Foundation Model for Oncologic Pathology | Jan 28, 2025 | BenchmarkingDiagnostic | CodeCode Available | 4 | 5 |
| ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge | Mar 24, 2023 | Information RetrievalLanguage Modeling | CodeCode Available | 4 | 5 |
| Diffusion Model-Based Image Editing: A Survey | Feb 27, 2024 | DenoisingImage Generation | CodeCode Available | 4 | 5 |
| DiffusionDet: Diffusion Model for Object Detection | Nov 17, 2022 | Denoisingmodel | CodeCode Available | 4 | 5 |
| DiffuEraser: A Diffusion Model for Video Inpainting | Jan 17, 2025 | modelOptical Flow Estimation | CodeCode Available | 4 | 5 |
| LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation | Nov 7, 2024 | Contrastive LearningImage Captioning | CodeCode Available | 4 | 5 |