| TorchFX: A modern approach to Audio DSP with PyTorch and GPU acceleration | Apr 11, 2025 | Audio Signal ProcessingBenchmarking | CodeCode Available | 2 |
| Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving | Apr 10, 2025 | GPULarge Language Model | CodeCode Available | 1 |
| DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction | Apr 10, 2025 | GPUPrediction | —Unverified | 0 |
| PoGO: A Scalable Proof of Useful Work via Quantized Gradient Descent and Merkle Proofs | Apr 10, 2025 | GPUQuantization | —Unverified | 0 |
| Search-contempt: a hybrid MCTS algorithm for training AlphaZero-like engines with better computational efficiency | Apr 10, 2025 | Computational EfficiencyGPU | —Unverified | 0 |
| GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable | Apr 10, 2025 | GPUMath | —Unverified | 0 |
| A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology | Apr 9, 2025 | Cell DetectionComputational Efficiency | CodeCode Available | 0 |
| CRYSIM: Prediction of Symmetric Structures of Large Crystals with GPU-based Ising Machines | Apr 9, 2025 | Bayesian OptimizationGPU | CodeCode Available | 0 |
| Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching | Apr 8, 2025 | GPUScheduling | —Unverified | 0 |
| GPU-accelerated Evolutionary Many-objective Optimization Using Tensorized NSGA-III | Apr 8, 2025 | Computational EfficiencyCPU | CodeCode Available | 3 |