SOTAVerified

CPU

Papers

Showing 551600 of 2231 papers

TitleStatusHype
PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch0
Adaptive Machine Learning for Resource-Constrained EnvironmentsCode0
V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms0
SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs0
BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems0
Audio Compression using Periodic Gabor with Biorthogonal Exchange: Implementation Using the Zak Transform0
Robust Learning-Based Sparse Recovery for Device Activity Detection in Grant-Free Random Access Cell-Free Massive MIMO: Enhancing Resilience to Impairments0
Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge0
Efficient Neural Clause-Selection Reinforcement0
Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices0
Coordinated Energy-Trajectory Economic Model Predictive Control for Autonomous Surface Vehicles under Disturbances0
HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection0
The impact of external uncertainties on the extreme return connectedness between food, fossil energy, and clean energy markets0
Spillover effects between climate policy uncertainty, energy markets, and food markets: A time-frequency analysis0
LapSum -- One Method to Differentiate Them All: Ranking, Sorting and Top-k Selection0
Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows0
Partial Convolution Meets Visual Attention0
Benchmarking Dynamic SLO Compliance in Distributed Computing Continuum SystemsCode0
Deterministic Global Optimization of the Acquisition Function in Bayesian Optimization: To Do or Not To Do?0
CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory0
Evaluation of adaptive sampling methods in scenario generation for virtual safety impact assessment of pre-crash safety systems0
AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks0
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval0
Striving for Faster and Better: A One-Layer Architecture with Auto Re-parameterization for Low-Light Image EnhancementCode0
LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis0
AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs0
SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix OperationsCode0
A Universal Framework for Compressing Embeddings in CTR PredictionCode0
Safe Beyond the Horizon: Efficient Sampling-based MPC with Neural Control Barrier Functions0
Distributed U-net model and Image Segmentation for Lung Cancer Detection0
Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective0
Object-Pose Estimation With Neural Population Codes0
On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation0
A^2ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization0
Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance0
Representation Learning on Out of Distribution in Tabular Data0
Weighted-Sum Energy Efficiency Maximization in User-Centric Uplink Cell-Free Massive MIMO0
DVFS-Aware DNN Inference on GPUs: Latency Modeling and Performance Analysis0
Crypto Miner Attack: GPU Remote Code Execution Attacks0
Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch PipelineCode0
Decoding Complexity: Intelligent Pattern Exploration with CHPDA (Context Aware Hybrid Pattern Detection Algorithm)0
fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving0
VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning0
Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set0
Accessible and Portable LLM Inference by Compiling Computational Graphs into SQL0
Ilargi: a GPU Compatible Factorized ML Model Training Framework0
Impulsive Relative Motion Control with Continuous-Time Constraint Satisfaction for Cislunar Space Missions0
adabmDCA 2.0 -- a flexible but easy-to-use package for Direct Coupling AnalysisCode0
DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning0
Smart Cubing for Graph Search: A Comparative Study0
Show:102550
← PrevPage 12 of 45Next →

No leaderboard results yet.