| A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment | Dec 10, 2024 | model | —Unverified | 0 |
| TT-MPD: Test Time Model Pruning and Distillation | Dec 10, 2024 | Knowledge Distillationmodel | —Unverified | 0 |
| Bumblebee: Foundation Model for Particle Physics Discovery | Dec 10, 2024 | model | —Unverified | 0 |
| Bidirectional Mamba state-space model for anomalous diffusion | Dec 10, 2024 | Mambamodel | —Unverified | 0 |
| IntellectSeeker: A Personalized Literature Management System with the Probabilistic Model and Large Language Model | Dec 10, 2024 | ArticlesFew-Shot Learning | CodeCode Available | 0 |
| SUPERMERGE: An Approach For Gradient-Based Model Merging | Dec 9, 2024 | model | —Unverified | 0 |
| See Further When Clear: Curriculum Consistency Model | Dec 9, 2024 | model | —Unverified | 0 |
| Copyright-Protected Language Generation via Adaptive Model Fusion | Dec 9, 2024 | Code Generationmodel | CodeCode Available | 0 |
| Pre-trained protein language model for codon optimization | Dec 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chimera: Improving Generalist Model with Domain-Specific Experts | Dec 8, 2024 | Mathmodel | —Unverified | 0 |