SOTAVerified

CPU

Papers

Showing 351400 of 2231 papers

TitleStatusHype
LSM-GNN: Large-scale Storage-based Multi-GPU GNN Training by Optimizing Data Transfer Scheme0
Regression prediction algorithm for energy consumption regression in cloud computing based on horned lizard algorithm optimised convolutional neural network-bidirectional gated recurrent unit0
OCTolyzer: Fully automatic toolkit for segmentation and feature extracting in optical coherence tomography and scanning laser ophthalmoscopy dataCode1
Mixture of Experts with Mixture of Precisions for Tuning Quality of Service0
Mixed-precision Neural Networks on RISC-V Cores: ISA extensions for Multi-Pumped Soft SIMD OperationsCode1
Attention in SRAM on Tenstorrent GrayskullCode1
RISC-V RVV efficiency for ANN algorithms0
FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty QuantificationCode1
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks0
MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training0
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation0
A Bag of Tricks for Scaling CPU-based Deep FFMs to more than 300m Predictions per Second0
TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing0
Analyzing Machine Learning Performance in a Hybrid Quantum Computing and HPC Environment0
Inference Performance Optimization for Large Language Models on CPUsCode3
Fast On-device LLM Inference with NPUsCode5
Accelerating MRI Uncertainty Estimation with Mask-based Bayesian Neural Network0
Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms0
Supporting Cross-language Cross-project Bug Localization Using Pre-trained Language Models0
Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization0
FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis0
Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM)0
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on EdgeCode4
Mooncake: A KVCache-centric Disaggregated Architecture for LLM ServingCode7
SLOctolyzer: Fully automatic analysis toolkit for segmentation and feature extracting in scanning laser ophthalmoscopy imagesCode1
Towards Dynamic Resource Allocation and Client Scheduling in Hierarchical Federated Learning: A Two-Phase Deep Reinforcement Learning Approach0
Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGACode1
UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture0
GPU-Accelerated DCOPF using Gradient-Based OptimizationCode0
Sparse High Rank Adapters0
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction NetworkCode0
Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference0
PixRO: Pixel-Distributed Rotational Odometry with Gaussian Belief Propagation0
Deep Symbolic Optimization for Combinatorial Optimization: Accelerating Node Selection by Discovering Potential HeuristicsCode0
GEB-1.3B: Open Lightweight Large Language Model0
Practical offloading for fine-tuning LLM on commodity GPU via learned sparse projectorsCode0
ProTrain: Efficient LLM Training via Memory-Aware Techniques0
PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models0
fSEAD: a Composable FPGA-based Streaming Ensemble Anomaly Detection LibraryCode0
PowerInfer-2: Fast Large Language Model Inference on a SmartphoneCode9
Investigating Memory Failure Prediction Across CPU Architectures0
Ensemble Method for System Failure Detection Using Large-Scale Telemetry Data0
MEFT: Memory-Efficient Fine-Tuning through Sparse AdapterCode1
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate ControlCode2
Position Paper: Think Globally, React Locally -- Bringing Real-time Reference-based Website Phishing Detection on macOS0
Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains0
LoReTrack: Efficient and Accurate Low-Resolution Transformer TrackingCode1
MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface DefectsCode1
Aerial Inspection of High-Voltage Power Lines Using YOLOv8 Real-Time Object DetectorCode0
Improving Simulation Regression Efficiency using a Machine Learning-based Method in Design Verification0
Show:102550
← PrevPage 8 of 45Next →

No leaderboard results yet.