| Numerical Schemes for Signature Kernels | Feb 12, 2025 | GPU | CodeCode Available | 0 |
| Bag of Tricks for Inference-time Computation of LLM Reasoning | Feb 11, 2025 | GPU | CodeCode Available | 1 |
| Memory Analysis on the Training Course of DeepSeek Models | Feb 11, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Feb 11, 2025 | Autonomous DrivingComputational Efficiency | —Unverified | 0 |
| Small Language Model Makes an Effective Long Text Extractor | Feb 11, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| Memory Is Not the Bottleneck: Cost-Efficient Continual Learning via Weight Space Consolidation | Feb 11, 2025 | class-incremental learningClass Incremental Learning | —Unverified | 0 |
| Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs | Feb 10, 2025 | GPU | CodeCode Available | 0 |
| Accelerating Outlier-robust Rotation Estimation by Stereographic Projection | Feb 10, 2025 | GPU | —Unverified | 0 |
| MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing | Feb 10, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| MERGE^3: Efficient Evolutionary Merging on Consumer-grade GPUs | Feb 9, 2025 | GPU | CodeCode Available | 1 |