| MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| On the Robustness of Reward Models for Language Model Alignment | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity | May 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Semantic Retention and Extreme Compression in LLMs: Can We Have Both? | May 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Matrix Is All You Need | May 11, 2025 | AllGPU | —Unverified | 0 |
| Impact of SMILES Notational Inconsistencies on Chemical Language Model Performance | May 11, 2025 | feature selectionLanguage Modeling | CodeCode Available | 0 |
| Web Page Classification using LLMs for Crawling Support | May 11, 2025 | ClassificationLanguage Modeling | CodeCode Available | 0 |
| TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking | May 11, 2025 | Fact CheckingFew-Shot Learning | —Unverified | 0 |