| CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing | Feb 4, 2025 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| Simulating Rumor Spreading in Social Networks using LLM Agents | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Speculative Ensemble: Fast Large Language Model Ensemble via Speculation | Feb 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Low-Rank Adapting Models for Sparse Autoencoders | Jan 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scalable-Softmax Is Superior for Attention | Jan 31, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training | Jan 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 2SSP: A Two-Stage Framework for Structured Pruning of LLMs | Jan 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings | Jan 28, 2025 | DenoisingDomain Generalization | CodeCode Available | 1 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |