| Hear Your Code Fail, Voice-Assisted Debugging for Python | Jul 20, 2025 | CPUMedical Diagnosis | —Unverified | 0 |
| 3C-FBI: A Combinatorial method using Convolutions for Circle Fitting in Blurry Images | Jul 15, 2025 | CPUDensity Estimation | —Unverified | 0 |
| Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive | Jul 13, 2025 | CPUInteractive Segmentation | —Unverified | 0 |
| MathOptAI.jl: Embed trained machine learning predictors into JuMP models | Jul 3, 2025 | CPUGaussian Processes | CodeCode Available | 2 |
| LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs | Jul 2, 2025 | CPUGPU | —Unverified | 0 |
| AUTOMATIC ROOM LIGHT CONTROLLER MANAGEMENT SYSTEM. | Jun 25, 2025 | 4kCPU | —Unverified | 0 |
| Causal-Aware Intelligent QoE Optimization for VR Interaction with Adaptive Keyframe Extraction | Jun 24, 2025 | Causal InferenceCPU | —Unverified | 0 |
| MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection | Jun 24, 2025 | CPULarge Language Model | —Unverified | 0 |
| Variational Bayesian Channel Estimation and Data Detection for Cell-Free Massive MIMO with Low-Resolution Quantized Fronthaul Links | Jun 23, 2025 | CPUQuantization | —Unverified | 0 |
| LIGHTHOUSE: Fast and precise distance to shoreline calculations from anywhere on earth | Jun 23, 2025 | CPU | CodeCode Available | 1 |
| ConsumerBench: Benchmarking Generative AI Applications on End-User Devices | Jun 21, 2025 | BenchmarkingCPU | CodeCode Available | 1 |
| Speeding up Local Optimization in Vehicle Routing with Tensor-based GPU Acceleration | Jun 20, 2025 | AttributeComputational Efficiency | —Unverified | 0 |
| Wavelet-based Global Orientation and Surface Reconstruction for Point Clouds | Jun 19, 2025 | CPUSurface Reconstruction | —Unverified | 0 |
| Distributed Activity Detection for Cell-Free Hybrid Near-Far Field Communications | Jun 17, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Parallel Branch Model Predictive Control on GPUs | Jun 16, 2025 | CPUGPU | —Unverified | 0 |
| Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the Torus | Jun 15, 2025 | CPUGPU | CodeCode Available | 0 |
| SecONNds: Secure Outsourced Neural Network Inference on ImageNet | Jun 13, 2025 | CPUGPU | CodeCode Available | 0 |
| HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration | Jun 12, 2025 | CPUData Augmentation | —Unverified | 0 |
| RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding | Jun 12, 2025 | CPUVoice Conversion | —Unverified | 0 |
| MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices | Jun 12, 2025 | CPUGPU | —Unverified | 0 |
| GPU-accelerated Modeling of Biological Regulatory Networks | Jun 10, 2025 | CPUglobal-optimization | —Unverified | 0 |
| Plug-and-Play Linear Attention for Pre-trained Image and Video Restoration Models | Jun 10, 2025 | CPUDeblurring | CodeCode Available | 0 |
| Implementing Keyword Spotting on the MCUX947 Microcontroller with Integrated NPU | Jun 10, 2025 | CPUKeyword Spotting | —Unverified | 0 |
| JavelinGuard: Low-Cost Transformer Architectures for LLM Security | Jun 9, 2025 | CPULarge Language Model | —Unverified | 0 |
| Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage | Jun 6, 2025 | CPUGPU | —Unverified | 0 |
| BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures | Jun 6, 2025 | BenchmarkingCPU | —Unverified | 0 |
| FlashDMoE: Fast Distributed MoE in a Single Kernel | Jun 5, 2025 | 16kCPU | CodeCode Available | 3 |
| Memory Access Characterization of Large Language Models in CPU Environment and its Potential Impacts | Jun 2, 2025 | CPU | —Unverified | 0 |
| PointODE: Lightweight Point Cloud Learning with Neural Ordinary Differential Equations on Edge | May 31, 2025 | CPU | —Unverified | 0 |
| Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule | May 28, 2025 | CPUGPU | —Unverified | 0 |
| Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs | May 28, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| CPINN-ABPI: Physics-Informed Neural Networks for Accurate Power Estimation in MPSoCs | May 28, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization | May 26, 2025 | CPUGPU | CodeCode Available | 1 |
| TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis | May 25, 2025 | CPUGPU | —Unverified | 0 |
| FastMamba: A High-Speed and Efficient Mamba Accelerator on FPGA with Accurate Quantization | May 25, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU | May 24, 2025 | CPUKeypoint Detection | CodeCode Available | 1 |
| QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design | May 22, 2025 | CPUGPU | CodeCode Available | 2 |
| Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models | May 21, 2025 | AllCPU | CodeCode Available | 0 |
| KernelOracle: Predicting the Linux Scheduler's Next Move with Deep Learning | May 21, 2025 | CPUDeep Learning | CodeCode Available | 0 |
| Harnessing Large Language Models Locally: Empirical Results and Implications for AI PC | May 21, 2025 | CPUQuantization | CodeCode Available | 0 |
| Machine Learning for Consistency Violation Faults Analysis | May 20, 2025 | CPU | —Unverified | 0 |
| FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference | May 19, 2025 | CPUGPU | —Unverified | 0 |
| ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates | May 18, 2025 | CPUGPU | —Unverified | 0 |
| MPRM: A Markov Path-based Rule Miner for Efficient and Interpretable Knowledge Graph Reasoning | May 18, 2025 | CPUKnowledge Graphs | —Unverified | 0 |
| A Heuristic Algorithm Based on Beam Search and Iterated Local Search for the Maritime Inventory Routing Problem | May 17, 2025 | CPU | —Unverified | 0 |
| Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets | May 16, 2025 | CPUScheduling | —Unverified | 0 |
| From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification | May 16, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| SpecOffload: Unlocking Latent GPU Capacity for LLM Inference on Resource-Constrained Devices | May 15, 2025 | CPUGPU | CodeCode Available | 1 |
| Lossless Compression for LLM Tensor Incremental Snapshots | May 14, 2025 | CPU | —Unverified | 0 |
| Single-shot prediction of parametric partial differential equations | May 14, 2025 | CPUGPU | —Unverified | 0 |