| Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L | May 1, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Aggregating empirical evidence from data strategy studies: a case on model quantization | May 1, 2025 | GPUQuantization | —Unverified | 0 |
| Sionna RT: Technical Report | Apr 30, 2025 | GPU | —Unverified | 0 |
| Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning | Apr 29, 2025 | CPUGPU | —Unverified | 0 |
| TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models | Apr 29, 2025 | BenchmarkingDataset Generation | CodeCode Available | 0 |
| semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage | Apr 28, 2025 | GPULarge Language Model | —Unverified | 0 |
| Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language | Apr 28, 2025 | Continual PretrainingGPU | —Unverified | 0 |
| FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation | Apr 28, 2025 | GPU | —Unverified | 0 |
| Accelerating Mixture-of-Experts Training with Adaptive Expert Replication | Apr 28, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| NSFlow: An End-to-End FPGA Framework with Scalable Dataflow Architecture for Neuro-Symbolic AI | Apr 27, 2025 | GPU | —Unverified | 0 |