| Matrix Is All You Need | May 11, 2025 | AllGPU | —Unverified | 0 |
| Streaming Krylov-Accelerated Stochastic Gradient Descent | May 11, 2025 | GPUStochastic Optimization | —Unverified | 0 |
| JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes | May 10, 2025 | BenchmarkingGPU | CodeCode Available | 1 |
| QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration | May 10, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| Challenging GPU Dominance: When CPUs Outperform for On-Device LLM Inference | May 9, 2025 | CPUGPU | —Unverified | 0 |
| FloE: On-the-Fly MoE Inference on Memory-constrained GPU | May 9, 2025 | CPUGPU | —Unverified | 0 |
| Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and Plates | May 9, 2025 | Audio SynthesisCPU | CodeCode Available | 1 |
| Boosting Performance on ARC is a Matter of Perspective | May 8, 2025 | ARCGPU | —Unverified | 0 |
| UltraGauss: Ultrafast Gaussian Reconstruction of 3D Ultrasound Volumes | May 8, 2025 | 3D ReconstructionComputational Efficiency | —Unverified | 0 |
| Steepest Descent Density Control for Compact 3D Gaussian Splatting | May 8, 2025 | 3DGSGPU | —Unverified | 0 |
| Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition | May 7, 2025 | Face DetectionFace Recognition | —Unverified | 0 |
| FastMap: Revisiting Dense and Scalable Structure from Motion | May 7, 2025 | GPU | CodeCode Available | 3 |
| Plexus: Taming Billion-edge Graphs with 3D Parallel GNN Training | May 7, 2025 | CPUGPU | —Unverified | 0 |
| Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration | May 7, 2025 | CPUFace Detection | —Unverified | 0 |
| Supporting renewable energy planning and operation with data-driven high-resolution ensemble weather forecast | May 7, 2025 | CPUGPU | —Unverified | 0 |
| LONGER: Scaling Up Long Sequence Modeling in Industrial Recommenders | May 7, 2025 | GPURecommendation Systems | —Unverified | 0 |
| Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving | May 6, 2025 | GPUScheduling | —Unverified | 0 |
| Can Large Language Models Predict Parallel Code Performance? | May 6, 2025 | GPU | —Unverified | 0 |
| NBF at SemEval-2025 Task 5: Light-Burst Attention Enhanced System for Multilingual Subject Recommendation | May 6, 2025 | GPURetrieval | —Unverified | 0 |
| Anant-Net: Breaking the Curse of Dimensionality with Scalable and Interpretable Neural Surrogate for High-Dimensional PDEs | May 6, 2025 | GPUKolmogorov-Arnold Networks | —Unverified | 0 |
| AnomalyMatch: Discovering Rare Objects of Interest with Semi-supervised and Active Learning | May 6, 2025 | Active LearningAnomaly Detection | CodeCode Available | 0 |
| RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference | May 5, 2025 | CPUGPU | —Unverified | 0 |
| Quantitative Analysis of Performance Drop in DeepSeek Model Quantization | May 5, 2025 | GPUQuantization | CodeCode Available | 0 |
| A UNet Model for Accelerated Preprocessing of CRISM Hyperspectral Data for Mineral Identification on Mars | May 4, 2025 | GPU | —Unverified | 0 |
| Sparfels: Fast Reconstruction from Sparse Unposed Imagery | May 4, 2025 | GPU | —Unverified | 0 |