SOTAVerified

CPU

Papers

Showing 476500 of 2231 papers

TitleStatusHype
Parallel Branch Model Predictive Control on GPUs0
Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the TorusCode0
SecONNds: Secure Outsourced Neural Network Inference on ImageNetCode0
HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration0
RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding0
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices0
Plug-and-Play Linear Attention for Pre-trained Image and Video Restoration ModelsCode0
GPU-accelerated Modeling of Biological Regulatory Networks0
Implementing Keyword Spotting on the MCUX947 Microcontroller with Integrated NPU0
JavelinGuard: Low-Cost Transformer Architectures for LLM Security0
Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage0
BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures0
Memory Access Characterization of Large Language Models in CPU Environment and its Potential Impacts0
PointODE: Lightweight Point Cloud Learning with Neural Ordinary Differential Equations on Edge0
CPINN-ABPI: Physics-Informed Neural Networks for Accurate Power Estimation in MPSoCs0
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs0
Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule0
FastMamba: A High-Speed and Efficient Mamba Accelerator on FPGA with Accurate Quantization0
TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis0
KernelOracle: Predicting the Linux Scheduler's Next Move with Deep LearningCode0
Harnessing Large Language Models Locally: Empirical Results and Implications for AI PCCode0
Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert ModelsCode0
Machine Learning for Consistency Violation Faults Analysis0
FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference0
MPRM: A Markov Path-based Rule Miner for Efficient and Interpretable Knowledge Graph Reasoning0
Show:102550
← PrevPage 20 of 90Next →

No leaderboard results yet.