SOTAVerified

CPU

Papers

Showing 401450 of 2231 papers

TitleStatusHype
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language ModelsCode1
ChamNet: Towards Efficient Network Design through Platform-Aware Model AdaptationCode1
PatrickStar: Parallel Training of Pre-trained Models via Chunk-based Memory ManagementCode1
PDFFlow: hardware accelerating parton density accessCode1
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-ExpertsCode1
Edge-Detect: Edge-centric Network Intrusion Detection using Deep Neural NetworkCode1
An introductory guide to aligning networks using SANA, the Simulated Annealing Network AlignerCode1
Pix2Prof: fast extraction of sequential information from galaxy imagery via a deep natural language 'captioning' modelCode1
Cleora: A Simple, Strong and Scalable Graph Embedding SchemeCode1
Closed-Form Diffeomorphic Transformations for Time Series AlignmentCode1
An open-source deep learning algorithm for efficient and fully-automatic analysis of the choroid in optical coherence tomographyCode1
PLSSVM: A (multi-)GPGPU-accelerated Least Squares Support Vector MachineCode1
Dynamic Perceiver for Efficient Visual RecognitionCode1
Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded PlatformsCode1
Dynamic Sparse Training with Structured SparsityCode1
EfficientBioAI: Making Bioimaging AI Models Efficient in Energy, Latency and RepresentationCode1
Preoperative brain tumor imaging: models and software for segmentation and standardized reportingCode1
Collage: Seamless Integration of Deep Learning Backends with Automatic PlacementCode1
Collapsible Linear Blocks for Super-Efficient Super ResolutionCode1
DRL-Based Federated Self-Supervised Learning for Task Offloading and Resource Allocation in ISAC-Enabled Vehicle Edge ComputingCode1
Combining Self-Training and Hybrid Architecture for Semi-supervised Abdominal Organ SegmentationCode1
PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular AccessesCode1
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence DraftingCode1
Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless ThreadsCode1
Injecting Domain Adaptation with Learning-to-hash for Effective and Efficient Zero-shot Dense RetrievalCode1
DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian SplattingCode1
Dynamic Low-Rank Sparse Adaptation for Large Language ModelsCode1
Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement LearningCode1
Real-Time Semantic Background SubtractionCode1
Differentiable Time-Frequency Scattering on GPUCode1
A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech EnhancementCode1
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning WorkloadsCode1
Comparing Popular Simulation Environments in the Scope of Robotics and Reinforcement LearningCode1
Diet deep generative audio models with structured lotteryCode1
APB2FaceV2: Real-Time Audio-Guided Multi-Face ReenactmentCode1
ADTrack: Target-Aware Dual Filter Learning for Real-Time Anti-Dark UAV TrackingCode1
A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband SpeechCode1
Scalable Memory Protection in the PENGLAI EnclaveCode1
DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable SurrogatesCode1
Deep learning approach to left ventricular non-compaction measurementCode1
Search-Based Regular Expression Inference on a GPUCode1
SECDA: Efficient Hardware/Software Co-Design of FPGA-based DNN Accelerators for Edge InferenceCode1
Asynchronous Methods for Deep Reinforcement LearningCode1
Design and Implementation of an FPGA-Based Hardware Accelerator for TransformerCode1
DGNN-Booster: A Generic FPGA Accelerator Framework For Dynamic Graph Neural Network InferenceCode1
Distributed Deep Neural-Network-Based Middleware for Cyber-Attacks Detection in Smart IoT Ecosystem: A Novel Framework and Performance Evaluation ApproachCode1
Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object DetectionCode1
Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object DetectionCode1
ConsumerBench: Benchmarking Generative AI Applications on End-User DevicesCode1
Denoising Autoencoders for fast Combinatorial Black Box OptimizationCode1
Show:102550
← PrevPage 9 of 45Next →

No leaderboard results yet.