| The Streaming Batch Model for Efficient and Fault-Tolerant Heterogeneous Execution | Jan 16, 2025 | CPUGPU | —Unverified | 0 |
| FASP: Fast and Accurate Structured Pruning of Large Language Models | Jan 16, 2025 | GPUModel Compression | —Unverified | 0 |
| GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping | Jan 15, 2025 | GPUSensor Fusion | —Unverified | 0 |
| Resource-Constrained Federated Continual Learning: What Does Matter? | Jan 15, 2025 | Continual LearningGPU | —Unverified | 0 |
| Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement | Jan 15, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Towards Lightweight Time Series Forecasting: a Patch-wise Transformer with Weak Data Enriching | Jan 14, 2025 | GPUTime Series | —Unverified | 0 |
| Keras Sig: Efficient Path Signature Computation on GPU in Keras 3 | Jan 14, 2025 | BenchmarkingC++ code | —Unverified | 0 |
| Physics-Informed Latent Neural Operator for Real-time Predictions of Complex Physical Systems | Jan 14, 2025 | GPUOperator learning | —Unverified | 0 |
| CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning | Jan 14, 2025 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Hierarchical Autoscaling for Large Language Model Serving with Chiron | Jan 14, 2025 | GPULanguage Modeling | —Unverified | 0 |