| SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model | Feb 4, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Prompt-based Depth Pruning of Large Language Models | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Feb 4, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| JingFang: A Traditional Chinese Medicine Large Language Model of Expert-Level Medical Diagnosis and Syndrome Differentiation-Based Treatment | Feb 4, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Reviving The Classics: Active Reward Modeling in Large Language Model Alignment | Feb 4, 2025 | Computational EfficiencyExperimental Design | CodeCode Available | 2 |
| Unlocking Efficient Large Inference Models: One-Bit Unrolling Tips the Scales | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Analyzing Similarity Metrics for Data Selection for Language Model Pretraining | Feb 4, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| LLM-USO: Large Language Model-based Universal Sizing Optimizer | Feb 4, 2025 | Bayesian OptimizationLanguage Modeling | —Unverified | 0 |
| MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving | Feb 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |