SOTAVerified

CPU

Papers

Showing 9511000 of 2231 papers

TitleStatusHype
Benchmarking State-of-the-Art Deep Learning Software Tools0
Fully Learnable Group Convolution for Acceleration of Deep Neural Networks0
FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs0
FusionANNS: An Efficient CPU/GPU Cooperative Processing Architecture for Billion-scale Approximate Nearest Neighbor Search0
Fusion of multispectral satellite imagery using a cluster of graphics processing unit0
FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads0
Finite volume method network for acceleration of unsteady computational fluid dynamics: non-reacting and reacting flows0
GANDSE: Generative Adversarial Network based Design Space Exploration for Neural Network Accelerator Design0
eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models0
Edinburgh's Submissions to the 2020 Machine Translation Efficiency Task0
A Convolutional Neural Network Cascade for Face Detection0
GNNear: Accelerating Full-Batch Training of Graph Neural Networks with Near-Memory Processing0
Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration0
GEB-1.3B: Open Lightweight Large Language Model0
Benchmarking of CPU-intensive Stream Data Processing in The Edge Computing Systems0
EDEN: Enabling Energy-Efficient, High-Performance Deep Neural Network Inference Using Approximate DRAM0
Analog CMOS-based Resistive Processing Unit for Deep Neural Network Training0
Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets0
Generative AI on the Edge: Architecture and Performance Evaluation0
Generative Design by Reinforcement Learning: Enhancing the Diversity of Topology Optimization Designs0
ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines0
GeneSys: Enabling Continuous Learning through Neural Network Evolution in Hardware0
Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms0
An Adaptive Device-Edge Co-Inference Framework Based on Soft Actor-Critic0
ECG Biometric Authentication Using Self-Supervised Learning for IoT Edge Sensors0
DyNet: Dynamic Convolution for Accelerating Convolution Neural Networks0
GHOST: A Graph Neural Network Accelerator using Silicon Photonics0
Benchmarking Edge AI Platforms for High-Performance ML Inference0
Cross-Stack Workload Characterization of Deep Recommendation Systems0
Joint Optimization of the Deployment and Resource Allocation of UAVs in Vehicular Edge Computing and Networks0
Hierarchical Federated Learning in Wireless Networks: Pruning Tackles Bandwidth Scarcity and System Heterogeneity0
DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge0
Dynamic Vision Sensor integration on FPGA-based CNN accelerators for high-speed visual classification0
Global Neighbor Sampling for Mixed CPU-GPU Training on Giant Graphs0
GnetDet: Object Detection Optimized on a 224mW CNN Accelerator Chip at the Speed of 106FPS0
GnetSeg: Semantic Segmentation Model Optimized on a 224mW CNN Accelerator Chip at the Speed of 318FPS0
Heterogeneous Acceleration Pipeline for Recommendation System Training0
Dynamic Transformer for Efficient Machine Translation on Embedded Devices0
Dynamic Superblock Pruning for Fast Learned Sparse Retrieval0
A Multi-Agent System Approach to Load-Balancing and Resource Allocation for Distributed Computing0
HFL: Hybrid Fuzzing on the Linux Kernel0
Google Coral-based edge computing person reidentification using human parsing combined with analytical method0
GossipGraD: Scalable Deep Learning using Gossip Communication based Asynchronous Gradient Descent0
GPGPU Acceleration of the KAZE Image Feature Extraction Algorithm0
A General Framework for Constrained Bayesian Optimization using Information-based Search0
GPTVQ: The Blessing of Dimensionality for LLM Quantization0
CWD: A Machine Learning based Approach to Detect Unknown Cloud Workloads0
GPU Accelerated Cascade Hashing Image Matching for Large Scale 3D Reconstruction0
AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks0
Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization0
Show:102550
← PrevPage 20 of 45Next →

No leaderboard results yet.