SOTAVerified

Inference Optimization

Papers

Showing 2130 of 56 papers

TitleStatusHype
Representing Edge Flows on Graphs via Sparse Cell ComplexesCode0
Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert MergingCode0
Residual-Based Error Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems0
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks0
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification0
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition0
Faster MoE LLM Inference for Extremely Large Models0
Federated Learning While Providing Model as a Service: Joint Training and Inference Optimization0
FluidML: Fast and Memory Efficient Inference Optimization0
Hellinger-Kantorovich Gradient Flows: Global Exponential Decay of Entropy Functionals0
Show:102550
← PrevPage 3 of 6Next →

No leaderboard results yet.