SOTAVerified

CPU

Papers

Showing 401450 of 2231 papers

TitleStatusHype
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language ModelsCode1
ChamNet: Towards Efficient Network Design through Platform-Aware Model AdaptationCode1
GEM: Online Globally consistent dense elevation mapping for unstructured terrainCode1
Fully automated analysis of muscle architecture from B-mode ultrasound images with deep learningCode1
Generating QM1B with PySCF_IPUCode1
Latent Replay for Real-Time Continual LearningCode1
An introductory guide to aligning networks using SANA, the Simulated Annealing Network AlignerCode1
CT-ICP: Real-time Elastic LiDAR Odometry with Loop ClosureCode1
Cleora: A Simple, Strong and Scalable Graph Embedding SchemeCode1
Closed-Form Diffeomorphic Transformations for Time Series AlignmentCode1
Glinthawk: A Two-Tiered Architecture for Offline LLM InferenceCode1
Scaling up HBM Efficiency of Top-K SpMV for Approximate Embedding Similarity on FPGAsCode1
Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory CachingCode1
CryptGPU: Fast Privacy-Preserving Machine Learning on the GPUCode1
DarkneTZ: Towards Model Privacy at the Edge using Trusted Execution EnvironmentsCode1
Learning from Event Cameras with Sparse Spiking Convolutional Neural NetworksCode1
CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learningCode1
Collage: Seamless Integration of Deep Learning Backends with Automatic PlacementCode1
Collapsible Linear Blocks for Super-Efficient Super ResolutionCode1
GPU Accelerated Exhaustive Search for Optimal Ensemble of Black-Box Optimization AlgorithmsCode1
CPU- and GPU-based Distributed Sampling in Dirichlet Process Mixtures for Large-scale AnalysisCode1
Cross-Camera Convolutional Color ConstancyCode1
Large Graph Convolutional Network Training with GPU-Oriented Data Communication ArchitectureCode1
SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning SystemsCode1
GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech RecognitionCode1
GPU-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech RecognitionCode1
Marius: Learning Massive Graph Embeddings on a Single MachineCode1
Kimera: an Open-Source Library for Real-Time Metric-Semantic Localization and MappingCode1
gpuRIR: A python library for Room Impulse Response simulation with GPU accelerationCode1
GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast DescriptorCode1
A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech EnhancementCode1
Graph-Cut RANSACCode1
Comparing Popular Simulation Environments in the Scope of Robotics and Reinforcement LearningCode1
Hardware System Implementation for Human Detection using HOG and SVM AlgorithmCode1
APB2FaceV2: Real-Time Audio-Guided Multi-Face ReenactmentCode1
ADTrack: Target-Aware Dual Filter Learning for Real-Time Anti-Dark UAV TrackingCode1
A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband SpeechCode1
Habitizing Diffusion Planning for Efficient and Effective Decision MakingCode1
ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome AnalysisCode1
Harmony: Overcoming the Hurdles of GPU Memory Capacity to Train Massive DNN Models on Commodity ServersCode1
Efficient Hyperparameter Optimization in Deep Learning Using a Variable Length Genetic AlgorithmCode1
Harnessing Deep Learning and HPC Kernels via High-Level Loop and Tensor Abstractions on CPU ArchitecturesCode1
KML: Using Machine Learning to Improve Storage SystemsCode1
Asynchronous Methods for Deep Reinforcement LearningCode1
Convolutional Sequence to Sequence LearningCode1
Correlation Filters for Unmanned Aerial Vehicle-Based Aerial Tracking: A Review and Experimental EvaluationCode1
L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN TrainingCode1
Consistent and Asymptotically Statistically-Efficient Solution to Camera Motion EstimationCode1
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer InferenceCode1
ConsumerBench: Benchmarking Generative AI Applications on End-User DevicesCode1
Show:102550
← PrevPage 9 of 45Next →

No leaderboard results yet.