| WaferLLM: Large Language Model Inference at Wafer Scale | Feb 6, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Feb 4, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization | Feb 3, 2025 | model | CodeCode Available | 2 |
| AIN: The Arabic INclusive Large Multimodal Model | Jan 31, 2025 | document understandingmodel | CodeCode Available | 2 |
| DiffGraph: Heterogeneous Graph Diffusion Model | Jan 4, 2025 | DenoisingGraph Generation | CodeCode Available | 2 |
| Metadata Conditioning Accelerates Language Model Pre-training | Jan 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large Language Model Safety: A Holistic Survey | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Tests for model misspecification in simulation-based inference: from local distortions to global model checks | Dec 19, 2024 | Anomaly Detectionmodel | CodeCode Available | 2 |
| Large Language Model Enhanced Recommender Systems: A Survey | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Maya: An Instruction Finetuned Multilingual Multimodal Model | Dec 10, 2024 | model | CodeCode Available | 2 |
| EMOv2: Pushing 5M Vision Model Frontier | Dec 9, 2024 | Image Generationmodel | CodeCode Available | 2 |
| BianCang: A Traditional Chinese Medicine Large Language Model | Nov 17, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 2 |
| Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents | Nov 10, 2024 | model | CodeCode Available | 2 |
| Model merging with SVD to tie the Knots | Oct 25, 2024 | model | CodeCode Available | 2 |
| Improve Vision Language Model Chain-of-thought Reasoning | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution | Oct 21, 2024 | Allmodel | CodeCode Available | 2 |
| A Multimodal Vision Foundation Model for Clinical Dermatology | Oct 19, 2024 | DiagnosticLesion Segmentation | CodeCode Available | 2 |
| Process Reward Model with Q-Value Rankings | Oct 15, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| MatMamba: A Matryoshka State Space Model | Oct 9, 2024 | modelRepresentation Learning | CodeCode Available | 2 |
| Learning Truncated Causal History Model for Video Restoration | Oct 4, 2024 | DeblurringDenoising | CodeCode Available | 2 |
| Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization | Sep 19, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| HSIGene: A Foundation Model For Hyperspectral Image Generation | Sep 19, 2024 | Data AugmentationDenoising | CodeCode Available | 2 |
| Language Model Powered Digital Biology with BRAD | Sep 4, 2024 | ChatbotCode Generation | CodeCode Available | 2 |
| Causal Agent based on Large Language Model | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| XMainframe: A Large Language Model for Mainframe Modernization | Aug 5, 2024 | Code SummarizationLanguage Modeling | CodeCode Available | 2 |