SOTAVerified

CPU

Papers

Showing 651700 of 2231 papers

TitleStatusHype
A Simple Sparse Matrix Vector Multiplication Approach to Padded ConvolutionCode0
An Integrated Artificial Intelligence Operating System for Advanced Low-Altitude Aviation Applications0
Improving Accuracy and Generalization for Efficient Visual Tracking0
A Runtime-Adaptive Transformer Neural Network Accelerator on FPGAsCode0
KVPR: Efficient LLM Inference with I/O-Aware KV Cache Partial RecomputationCode0
A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference0
Plastic Arbor: a modern simulation framework for synaptic plasticity x2013 from single synapses to networks of morphological neuronsCode0
OPMOS: Ordered Parallel Algorithm for Multi-Objective Shortest-Paths0
SMM-Conv: Scalar Matrix Multiplication with Zero Packing for Accelerated Convolution0
Deep operator network models for predicting post-burn contraction0
Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations0
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs0
Generative AI on the Edge: Architecture and Performance Evaluation0
Towards Accurate and Efficient Sub-8-Bit Integer Training0
Pie: Pooling CPU Memory for LLM Inference0
Offline Adaptation of Quadruped Locomotion using Diffusion ModelsCode0
Input-Based Ensemble-Learning Method for Dynamic Memory Configuration of Serverless Computing Functions0
TinyML Security: Exploring Vulnerabilities in Resource-Constrained Machine Learning Systems0
Project Tracyn: Generative Artificial Intelligence based Peripherals Trace Synthesizer0
P-MOSS: Learned Scheduling For Indexes Over NUMA Servers Using Low-Level Hardware Statistics0
DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning Workloads0
Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing0
AI-Ready Energy Modelling for Next Generation RANCode0
NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM InferenceCode0
DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge0
Conditioned quantum-assisted deep generative surrogate for particle-calorimeter interactions0
Cora: Accelerating Stateful Network Applications with SmartNICs0
AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks0
Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows0
Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved OffloadingCode0
Multi-objective Optimization in CPU Design Space Exploration: Attention is All You Need0
Structured Connectivity for 6G Reflex Arc: Task-Oriented Virtual User and New Uplink-Downlink Tradeoff0
Sensing-Communication-Computing-Control Closed-Loop Optimization for 6G Unmanned Robotic Systems0
ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference0
FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs0
AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower Cost0
Accelerate Coastal Ocean Circulation Model with AI Surrogate0
CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment0
Towards Arbitrary QUBO Optimization: Analysis of Classical and Quantum-Activated Feedforward Neural Networks0
A Transformer Based Generative Chemical Language AI Model for Structural Elucidation of Organic Compounds0
Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property PredictionCode0
Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large ModelsCode0
Bukva: Russian Sign Language AlphabetCode0
ActNAS : Generating Efficient YOLO Models using Activation NAS0
Dense Optimizer : An Information Entropy-Guided Structural Search Method for Dense-like Neural Network Design0
KV Prediction for Improved Time to First Token0
An Innovative Solution: AI-Based Digital Screen-Integrated Tables for Educational Settings0
Fast Object Detection with a Machine Learning Edge Device0
Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning0
Predictive Attractor Models0
Show:102550
← PrevPage 14 of 45Next →

No leaderboard results yet.