SOTAVerified

Inference Optimization

Papers

Showing 2650 of 56 papers

TitleStatusHype
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition0
Faster MoE LLM Inference for Extremely Large Models0
Federated Learning While Providing Model as a Service: Joint Training and Inference Optimization0
FluidML: Fast and Memory Efficient Inference Optimization0
Hellinger-Kantorovich Gradient Flows: Global Exponential Decay of Entropy Functionals0
Hybrid Offline-online Scheduling Method for Large Language Model Inference Optimization0
Inference Optimization of Foundation Models on AI Accelerators0
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization0
The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities0
Investigations on the inference optimization techniques and their impact on multiple hardware platforms for Semantic Segmentation0
SySMOL: Co-designing Algorithms and Hardware for Neural Networks with Heterogeneous Precisions0
Learning to Infer0
An approach to optimize inference of the DIART speaker diarization pipeline0
Networked Signal and Information Processing0
Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition0
Advances and Open Challenges in Federated Foundation Models0
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries0
Bayesian Active Learning in the Presence of Nuisance Parameters0
Revisiting SMoE Language Models by Evaluating Inefficiencies with Task Specific Expert Pruning0
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization0
SBbadger: Biochemical Reaction Networks with Definable Degree Distributions0
Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval0
Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation0
A bi-partite generative model framework for analyzing and simulating large scale multiple discrete-continuous travel behaviour data0
Deep Signal Recovery with One-Bit Quantization0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.