| Oscillation-Reduced MXFP4 Training for Vision Transformers | Feb 28, 2025 | GPUQuantization | CodeCode Available | 1 |
| Scalable Signature Kernel Computations for Long Time Series via Local Neumann Series Expansions | Feb 27, 2025 | GPUTime Series | CodeCode Available | 1 |
| Dynamic Low-Rank Sparse Adaptation for Large Language Models | Feb 20, 2025 | CPUGPU | CodeCode Available | 1 |
| Myna: Masking-Based Contrastive Learning of Musical Representations | Feb 18, 2025 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| AdaSplash: Adaptive Sparse Flash Attention | Feb 17, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| CalibQuant: 1-Bit KV Cache Quantization for Multimodal LLMs | Feb 15, 2025 | Computational EfficiencyGPU | CodeCode Available | 1 |
| Small Language Model Makes an Effective Long Text Extractor | Feb 11, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| Bag of Tricks for Inference-time Computation of LLM Reasoning | Feb 11, 2025 | GPU | CodeCode Available | 1 |
| MERGE^3: Efficient Evolutionary Merging on Consumer-grade GPUs | Feb 9, 2025 | GPU | CodeCode Available | 1 |
| SyMANTIC: An Efficient Symbolic Regression Method for Interpretable and Parsimonious Model Discovery in Science and Beyond | Feb 5, 2025 | feature selectionGPU | CodeCode Available | 1 |