| Personalize Segment Anything Model with One Shot | May 4, 2023 | Image Generationmodel | CodeCode Available | 3 |
| Discovering Language Model Behaviors with Model-Written Evaluations | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Reasoning with Language Model Prompting: A Survey | Dec 19, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 3 |
| MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model | Nov 1, 2022 | Anomaly DetectionBrain Tumor Segmentation | CodeCode Available | 3 |
| Model-Free Opponent Shaping | May 3, 2022 | model | CodeCode Available | 3 |
| Jukebox: A Generative Model for Music | Apr 30, 2020 | model | CodeCode Available | 3 |
| Model-based Asynchronous Hyperparameter and Neural Architecture Search | Mar 24, 2020 | AutoMLBayesian Optimization | CodeCode Available | 3 |
| First Order Motion Model for Image Animation | Feb 29, 2020 | Image Animationmodel | CodeCode Available | 3 |
| EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks | May 28, 2019 | Action RecognitionDomain Generalization | CodeCode Available | 3 |
| RecGPT: A Foundation Model for Sequential Recommendation | Jun 6, 2025 | Decodermodel | CodeCode Available | 2 |
| Model-Preserving Adaptive Rounding | May 29, 2025 | modelQuantization | CodeCode Available | 2 |
| VeriThinker: Learning to Verify Makes Reasoning Model Efficient | May 23, 2025 | model | CodeCode Available | 2 |
| Structure-Aligned Protein Language Model | May 22, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| SEED: Speaker Embedding Enhancement Diffusion Model | May 22, 2025 | modelSpeaker Recognition | CodeCode Available | 2 |
| Mergenetic: a Simple Evolutionary Model Merging Library | May 16, 2025 | Evolutionary Algorithmsmodel | CodeCode Available | 2 |
| Diffusion Model Quantization: A Review | May 8, 2025 | modelQuantization | CodeCode Available | 2 |
| RWKV-X: A Linear Complexity Hybrid Language Model | Apr 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DiMeR: Disentangled Mesh Reconstruction Model | Apr 24, 2025 | Image to 3Dmodel | CodeCode Available | 2 |
| Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability | Apr 13, 2025 | model | CodeCode Available | 2 |
| Learned Image Compression with Dictionary-based Entropy Model | Apr 1, 2025 | Image Compressionmodel | CodeCode Available | 2 |
| A Neural Symbolic Model for Space Physics | Mar 11, 2025 | Large Language Modelmodel | CodeCode Available | 2 |
| voc2vec: A Foundation Model for Non-Verbal Vocalization | Feb 22, 2025 | model | CodeCode Available | 2 |
| Optimizing Model Selection for Compound AI Systems | Feb 20, 2025 | modelModel Selection | CodeCode Available | 2 |
| Continuous Diffusion Model for Language Modeling | Feb 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Automated Capability Discovery via Model Self-Exploration | Feb 11, 2025 | model | CodeCode Available | 2 |
| WaferLLM: Large Language Model Inference at Wafer Scale | Feb 6, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Feb 4, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization | Feb 3, 2025 | model | CodeCode Available | 2 |
| AIN: The Arabic INclusive Large Multimodal Model | Jan 31, 2025 | document understandingmodel | CodeCode Available | 2 |
| DiffGraph: Heterogeneous Graph Diffusion Model | Jan 4, 2025 | DenoisingGraph Generation | CodeCode Available | 2 |
| Metadata Conditioning Accelerates Language Model Pre-training | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large Language Model Safety: A Holistic Survey | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Tests for model misspecification in simulation-based inference: from local distortions to global model checks | Dec 19, 2024 | Anomaly Detectionmodel | CodeCode Available | 2 |
| Large Language Model Enhanced Recommender Systems: A Survey | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Maya: An Instruction Finetuned Multilingual Multimodal Model | Dec 10, 2024 | model | CodeCode Available | 2 |
| EMOv2: Pushing 5M Vision Model Frontier | Dec 9, 2024 | Image Generationmodel | CodeCode Available | 2 |
| BianCang: A Traditional Chinese Medicine Large Language Model | Nov 17, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 2 |
| Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents | Nov 10, 2024 | model | CodeCode Available | 2 |
| Model merging with SVD to tie the Knots | Oct 25, 2024 | model | CodeCode Available | 2 |
| Improve Vision Language Model Chain-of-thought Reasoning | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution | Oct 21, 2024 | Allmodel | CodeCode Available | 2 |
| A Multimodal Vision Foundation Model for Clinical Dermatology | Oct 19, 2024 | DiagnosticLesion Segmentation | CodeCode Available | 2 |
| Process Reward Model with Q-Value Rankings | Oct 15, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| MatMamba: A Matryoshka State Space Model | Oct 9, 2024 | modelRepresentation Learning | CodeCode Available | 2 |
| Learning Truncated Causal History Model for Video Restoration | Oct 4, 2024 | DeblurringDenoising | CodeCode Available | 2 |
| Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization | Sep 19, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| HSIGene: A Foundation Model For Hyperspectral Image Generation | Sep 19, 2024 | Data AugmentationDenoising | CodeCode Available | 2 |
| Language Model Powered Digital Biology with BRAD | Sep 4, 2024 | ChatbotCode Generation | CodeCode Available | 2 |
| Causal Agent based on Large Language Model | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| XMainframe: A Large Language Model for Mainframe Modernization | Aug 5, 2024 | Code SummarizationLanguage Modeling | CodeCode Available | 2 |