SOTAVerified

CPU

Papers

Showing 151200 of 2231 papers

TitleStatusHype
Crypto Miner Attack: GPU Remote Code Execution Attacks0
Decoding Complexity: Intelligent Pattern Exploration with CHPDA (Context Aware Hybrid Pattern Detection Algorithm)0
Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch PipelineCode0
fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving0
VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning0
Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set0
Accessible and Portable LLM Inference by Compiling Computational Graphs into SQL0
Ilargi: a GPU Compatible Factorized ML Model Training Framework0
Impulsive Relative Motion Control with Continuous-Time Constraint Satisfaction for Cislunar Space Missions0
DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learning0
adabmDCA 2.0 -- a flexible but easy-to-use package for Direct Coupling AnalysisCode0
Smart Cubing for Graph Search: A Comparative Study0
Return of the Encoder: Maximizing Parameter Efficiency for SLMsCode1
Billion-scale Similarity Search Using a Hybrid Indexing Approach with Advanced Filtering0
STMDNet: A Lightweight Directional Framework for Motion Pattern Recognition of Tiny TargetsCode0
HEPPO: Hardware-Efficient Proximal Policy Optimization -- A Universal Pipelined Architecture for Generalized Advantage Estimation0
Multi-Tenant SmartNICs for In-Network Preprocessing of Recommender Systems0
Sublinear Variational Optimization of Gaussian Mixture Models with Millions to Billions of Parameters0
Glinthawk: A Two-Tiered Architecture for Offline LLM InferenceCode1
MOFA: Discovering Materials for Carbon Capture with a GenAI- and Simulation-Based Workflow0
No More Sliding Window: Efficient 3D Medical Image Segmentation with Differentiable Top-k Patch Sampling0
PixelBrax: Learning Continuous Control from Pixels End-to-End on the GPUCode0
The Streaming Batch Model for Efficient and Fault-Tolerant Heterogeneous Execution0
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement0
Keras Sig: Efficient Path Signature Computation on GPU in Keras 30
A Federated Deep Learning Framework for Cell-Free RSMA Networks0
TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response ScenariosCode2
Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing0
TimeRL: Efficient Deep Reinforcement Learning with Polyhedral Dependence Graphs0
A GPU Implementation of Multi-Guiding Spark Fireworks Algorithm for Efficient Black-Box Neural Network OptimizationCode0
Finite Element Method for HJB in Option Pricing with Stock Borrowing Fees0
Predicting two-dimensional spatiotemporal chaotic patterns with optimized high-dimensional hybrid reservoir computing0
Learning from Ambiguous Data with Hard Labels0
FED: Fast and Efficient Dataset Deduplication Framework with GPU AccelerationCode0
Minimal Interaction Seperated Tuning: A New Paradigm for Visual Adaptation0
Enhancing Deployment-Time Predictive Model Robustness for Code Analysis and OptimizationCode0
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors0
FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI0
Dynamic Optimization of Storage Systems Using Reinforcement Learning Techniques0
Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation0
Assessing Text Classification Methods for Cyberbullying Detection on Social Media Platforms0
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference0
TPCH: Tensor-interacted Projection and Cooperative Hashing for Multi-view ClusteringCode0
High-Rank Irreducible Cartesian Tensor Decomposition and Bases of Equivariant SpacesCode0
Unsupervised Learning Approach for Beamforming in Cell-Free Integrated Sensing and Communication0
Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation ModelsCode9
Power- and Fragmentation-aware Online Scheduling for GPU DatacentersCode0
Hybrid Network- and User-Centric Scalable Cell-Free Massive MIMO for Fronthaul Signaling MinimizationCode0
WebLLM: A High-Performance In-Browser LLM Inference EngineCode11
Energy consumption of code small language models serving with runtime engines and execution providers0
Show:102550
← PrevPage 4 of 45Next →

No leaderboard results yet.