| LoLA: Low-Rank Linear Attention With Sparse Caching | May 29, 2025 | 4k8k | —Unverified | 0 |
| LlamaRL: A Distributed Asynchronous Reinforcement Learning Framework for Efficient Large-scale LLM Trainin | May 29, 2025 | GPUReinforcement Learning (RL) | —Unverified | 0 |
| Holistic Large-Scale Scene Reconstruction via Mixed Gaussian Splatting | May 29, 2025 | 3D Scene ReconstructionGPU | CodeCode Available | 1 |
| Accelerating AllReduce with a Persistent Straggler | May 29, 2025 | GPU | CodeCode Available | 1 |
| LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering | May 29, 2025 | 3DGSGPU | —Unverified | 0 |
| ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS | May 29, 2025 | 3DGSGPU | CodeCode Available | 2 |
| LUMION: Fast Fault Recovery for ML Jobs Using Programmable Optical Fabrics | May 29, 2025 | GPU | —Unverified | 0 |
| CF-DETR: Coarse-to-Fine Transformer for Real-Time Object Detection | May 29, 2025 | GPUobject-detection | —Unverified | 0 |
| Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule | May 28, 2025 | CPUGPU | —Unverified | 0 |
| Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design | May 28, 2025 | GPUQuantization | CodeCode Available | 1 |