| Parallel Branch Model Predictive Control on GPUs | Jun 16, 2025 | CPUGPU | —Unverified | 0 |
| Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the Torus | Jun 15, 2025 | CPUGPU | CodeCode Available | 0 |
| SecONNds: Secure Outsourced Neural Network Inference on ImageNet | Jun 13, 2025 | CPUGPU | CodeCode Available | 0 |
| HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration | Jun 12, 2025 | CPUData Augmentation | —Unverified | 0 |
| RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding | Jun 12, 2025 | CPUVoice Conversion | —Unverified | 0 |
| MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices | Jun 12, 2025 | CPUGPU | —Unverified | 0 |
| Plug-and-Play Linear Attention for Pre-trained Image and Video Restoration Models | Jun 10, 2025 | CPUDeblurring | CodeCode Available | 0 |
| GPU-accelerated Modeling of Biological Regulatory Networks | Jun 10, 2025 | CPUglobal-optimization | —Unverified | 0 |
| Implementing Keyword Spotting on the MCUX947 Microcontroller with Integrated NPU | Jun 10, 2025 | CPUKeyword Spotting | —Unverified | 0 |
| JavelinGuard: Low-Cost Transformer Architectures for LLM Security | Jun 9, 2025 | CPULarge Language Model | —Unverified | 0 |
| Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage | Jun 6, 2025 | CPUGPU | —Unverified | 0 |
| BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures | Jun 6, 2025 | BenchmarkingCPU | —Unverified | 0 |
| Memory Access Characterization of Large Language Models in CPU Environment and its Potential Impacts | Jun 2, 2025 | CPU | —Unverified | 0 |
| PointODE: Lightweight Point Cloud Learning with Neural Ordinary Differential Equations on Edge | May 31, 2025 | CPU | —Unverified | 0 |
| CPINN-ABPI: Physics-Informed Neural Networks for Accurate Power Estimation in MPSoCs | May 28, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs | May 28, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule | May 28, 2025 | CPUGPU | —Unverified | 0 |
| FastMamba: A High-Speed and Efficient Mamba Accelerator on FPGA with Accurate Quantization | May 25, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis | May 25, 2025 | CPUGPU | —Unverified | 0 |
| KernelOracle: Predicting the Linux Scheduler's Next Move with Deep Learning | May 21, 2025 | CPUDeep Learning | CodeCode Available | 0 |
| Harnessing Large Language Models Locally: Empirical Results and Implications for AI PC | May 21, 2025 | CPUQuantization | CodeCode Available | 0 |
| Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models | May 21, 2025 | AllCPU | CodeCode Available | 0 |
| Machine Learning for Consistency Violation Faults Analysis | May 20, 2025 | CPU | —Unverified | 0 |
| FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference | May 19, 2025 | CPUGPU | —Unverified | 0 |
| MPRM: A Markov Path-based Rule Miner for Efficient and Interpretable Knowledge Graph Reasoning | May 18, 2025 | CPUKnowledge Graphs | —Unverified | 0 |