SOTAVerified

CPU

Papers

Showing 251300 of 2231 papers

TitleStatusHype
AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks0
Cora: Accelerating Stateful Network Applications with SmartNICs0
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM InferenceCode3
Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows0
Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved OffloadingCode0
Sensing-Communication-Computing-Control Closed-Loop Optimization for 6G Unmanned Robotic Systems0
Structured Connectivity for 6G Reflex Arc: Task-Oriented Virtual User and New Uplink-Downlink Tradeoff0
Multi-objective Optimization in CPU Design Space Exploration: Attention is All You Need0
Rawsamble: Overlapping and Assembling Raw Nanopore Signals using a Hash-based Seeding MechanismCode2
ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference0
AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower Cost0
FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs0
MagicPIG: LSH Sampling for Efficient LLM GenerationCode3
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN ProblemsCode4
Accelerate Coastal Ocean Circulation Model with AI Surrogate0
syren-new: Precise formulae for the linear and nonlinear matter power spectra with massive neutrinos and dynamical dark energyCode1
CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment0
Towards Arbitrary QUBO Optimization: Analysis of Classical and Quantum-Activated Feedforward Neural Networks0
A Transformer Based Generative Chemical Language AI Model for Structural Elucidation of Organic Compounds0
ActNAS : Generating Efficient YOLO Models using Activation NAS0
Bukva: Russian Sign Language AlphabetCode0
Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large ModelsCode0
Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property PredictionCode0
KV Prediction for Improved Time to First TokenCode0
Octopus Inspired Optimization Algorithm: Multi-Level Structures and Parallel Computing StrategiesCode1
Dense Optimizer : An Information Entropy-Guided Structural Search Method for Dense-like Neural Network Design0
An Innovative Solution: AI-Based Digital Screen-Integrated Tables for Educational Settings0
Large Language Model Inference Acceleration: A Comprehensive Hardware PerspectiveCode1
Fast Object Detection with a Machine Learning Edge Device0
Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning0
Predictive Attractor Models0
Modeling the Energy Consumption of the HEVC Software Encoding Process using Processor events0
Lotus: learning-based online thermal and latency variation management for two-stage detectors on edge devicesCode0
A Low-Cost, High-Speed, and Robust Bin Picking System for Factory Automation Enabled by a Non-Stop, Multi-View, and Active Vision Scheme0
Accelerating PoT Quantization on Edge DevicesCode0
Simulation-based inference with the Python Package sbijax0
The impact of climate policy uncertainty on financial market resilience: Evidence from China0
TensorSocket: Shared Data Loading for Deep Learning Training0
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates0
Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit0
FusionANNS: An Efficient CPU/GPU Cooperative Processing Architecture for Billion-scale Approximate Nearest Neighbor Search0
CNN Mixture-of-Depths0
Distributed Channel Estimation and Optimization for 6D Movable Antenna: Unveiling Directional Sparsity0
Benchmarking Edge AI Platforms for High-Performance ML Inference0
Efficient Tabular Data Preprocessing of ML Pipelines0
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs0
OATS: Outlier-Aware Pruning Through Sparse and Low Rank DecompositionCode1
EFA-YOLO: An Efficient Feature Attention Model for Fire and Flame Detection0
Magika: AI-Powered Content-Type DetectionCode11
Show:102550
← PrevPage 6 of 45Next →

No leaderboard results yet.